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ABSTRACT 


Spatial  and  Temporal  Climate  Variations  Influencing  Medium-range 
Temperature  Predictions  over  South-central  European  Russia.  (May  1990) 
Jeffrey  Edward  Johnson.  B.S., University  of  Colorado 
Chair  of  Advisory  Committee:  Prof.  John  F.  Griffiths 


-  -v-  To  issue  medium-range  forecasts,  forecasters  should  integrate  climatology,  syn¬ 

optic  meteorology,  statistics,  local  topography,  computer  science,  human  experience 
and  judgment.  Often  <Aimes,  if  forecasts  extend  beyond  72  hours,  a  systematic  ap¬ 
proach  is  abandoned  for  a  more  haphazard  one.  The  aim  of  this  research  is  to  reduce 
a  forecaster's  guesswork  and  make  the  forecast  methodology  more  objective. 

A  general  climatology  was  developed  for  south-central  European  Russia. 
Monthly  and  diurnal  trends  are  easily  noted.  One  station,  Astrakhan,  was  selected 
for  a  more  detailed  climatology.  A  monthly  overview  for  9  stations  in  the  region  is 
given  for  the  weather  elements  as  stated  above.  Pentad  and  3-hourly  averages  are  also 
included.  The  detailed  climatology  is  helpful  in  orienting  a  new  weather  forecaster  to 
the  peculiarities  of  a  novel  region. 

Regression  equations  were  developed  using  lag  functions  of  1-5  days.  Low  and 
high  temperatures  for  the  forecast  period  were  regressed  back  to  day  zero.  Low  and 

/!  ‘  I  ■' 

high  temperatures  lare  predicted  for  1-5  days  past  the  initial  forecast  day. 

The  regression  equations  developed  from  some  24  years  of  data  have  a  higher 
percentage  of  correctly  predicting  temperatures  at  the  times  mentioned  than  do 


climatology,  persistence,  or  combinations  thereof. 
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CHAPTER  I 

INTRODUCTION 

Medium-range  military  operations  require  knowledge  of  weather  at  least  several 
days  into  the  future.  Knowing  specific  weather  elements  is  essential  for  planning 
purposes  and  for  successful  operations  (Jacobs  1947). 

The  United  States  National  Weather  Service,  USAF  Air  Weather  Service, 
and  USN  Fleet  Numerical  Central  are  among  agencies  that  currently  issue  regional 
weather  forecasts.  It  is  the  job  of  the  local  forecaster  to  interpret  and  modify 
generalized  forecasts  for  his  or  her  specific  area  of  interest.  To  achieve  tailored  weather 
forecasts,  forecasters  should  integrate  the  following  disciplines  or  topics:  climatology, 
synoptic  meteorology,  statistics,  geography  or  local  topography,  computer  science, 
human  experience  and  human  judgment.  As  in  any  science,  to  achieve  objectivity, 
the  aim  is  to  minimize  human  subjectivity.  To  do  so  requires  a  systematic  approach 
to  the  problem.  Many  people  issuing  extended  forecasts  do  so  haphazardly  or  with 
“gut  feelings.”  The  aim  of  this  study  is  to  reduce  the  forecaster's  guesswork  when 
issuing  medium-range  temperature  forecasts. 

Objective 

The  purpose  of  developing  these  medium-range  weather  forecasting  models  is 
to  allow  a  forecaster  to  issue  a  “5-day  temperature  forecast”  objectively  with  limited 
data.  Required  ingredients  for  the  model  are  climatology  for  the  station,  and  today's 

This  thesis  follows  the  style  based  on  the  journal  Monthly  Weather  Review. 


and  the  previous  day's  synoptic  weather  observations.  The  model  will  predict  high 
and  low  temperatures  1  through  5  days  out. 

Basic  Problems 

Two  problems  will  be  tackled.  First,  a  climatology  for  south-central  European 
Russia  will  be  developed.  Second,  a  predictive  model  for  high  and  low  temperatures 
will  be  developed.  The  problems  are  obviously  intertwined. 

Given  an  initial  state  of  the  atmosphere,  the  fundamental  problem  of  numerical 
weather  prediction  is  to  determine  a  future  state  by  integrating  the  system  of  hydro- 
dynamic  and  thermodynamic  equations  which  describe  physical  processes  occurring 
in  the  atmosphere.  The  complexity  of  processes  these  equations  represent  offer  great 
difficulties  in  direct  analytical  solutions.  The  application  of  climatological  records  to 
the  forecast  procedure  has  been  rather  limited.  Also,  the  field  of  climatology  generally 
has  been  viewed  as  offering  little  help  to  the  weather  forecaster.  Actually,  climatology 
is  of  very  great  assistance  inasmuch  as  it  can  be  thought  of  as  the  “experience"  which 
is  perhaps  the  most  powerful  tool  which  the  forecaster  can  bring  to  bear,  especially 
in  medium  and  long-range  forecasting  (Bundgaard  1951).  What  is  required  is  to  or¬ 
ganize  the  climatological  data  in  such  a  way  that  its  usefulness  can  be  substantially 
enhanced. 

The  problem  at  hand  involves  statistical,  meteorological,  and  computational 
(computer)  techniques.  The  statistical  approach  requires  the  use  of  basic  statistics 
and  elementary  time  series.  Meteorological  aspects  were  considered  from  the  view¬ 
point  of  synoptic  climatology,  so-called  because  by  its  very  nature  the  approach  is 
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Figure  1.  Study  Area:  South-central  European  Russia 

climatological;  yet  because  it  deals  with  individual  weather  situations,  it  may  also 
be  called  synoptic.  The  computer  work  used  Statistical  Analysis  System  (SAS),  a 
software  system  designed  especially  for  data  analysis. 

Data 

The  area  of  interest  is  45°  N  to  55°  N  latitude  and  25°  E  to  55°  E  longitude. 
This  area  lies  in  south-central  European  Russia,  north  of  the  Caspian  and  Black  Seas 
(Fig.  1).  Dates  of  interest  are  January  1959  to  December  1986. 

Magnetic  tapes  of  3-hourly  surface  synoptic  data  were  obtained  from  the  USAF 
Environmental  Technical  Applications  Center  at  Asheville.  North  Carolina.  The  data 
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included:  station  identifier,  year,  month,  day,  observation  time;  wind  direction,  wind 
speed,  dry  bulb  temperature,  dewpoint  depression,  station  pressure,  visibility,  sky 
condition,  ceiling  height,  total  precipitation  for  the  past  6  hours,  and  current  weather. 
Nine  stations  were  selected  in  the  above  grid  area  based  on  the  availability  of  3-hourly 
synoptic  observations  and  at  least  20  years  of  record. 
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CHAPTER  II 

LITERATURE  REVIEW 

The  geographical  distribution  and  seasonal  occurrence  of  planetary  and  synoptic 
scale  systems  are  fairly  well  known,  but  on  the  mesoscale  this  is  not  the  case. 
Moreover,  interactions  of  the  larger  scales  of  motion  are  influenced  by  geographical 
controls.  Thus,  the  planetary  wave  pattern  is  related  to  heat  sources  and  sinks  as  well 
as  to  large-scale  orography,  and  the  formation  and  movement  of  frontal  depressions 
is  in  part  governed  by  these  planetary  waves.  Mesoscale  systems  appear  to  be  related 
both  to  geographical  features  and  to  larger  scale  systems  (Hare  1957). 

A  regional  macroclimate  may  be  accounted  for  in  general  terms  by  reference 
to  planetary  wave  patterns  and  particular  regimes  of  synoptic  and  mesoscale  distur¬ 
bances  on  seasonal  and  shorter  time  scales  (Feyerherm  and  Bark  1965).  Synoptic 
climatology  can  provide  a  vital  link  between  hourly  and  daily  climatology  data.  Also, 
by  its  attention  to  long-term  synoptic  variability,  synoptic  climatology  can  play  a 
major  role  in  developing  our  understanding  of  world  climate  (Barry  and  Perry  1973). 

If  an  individual  map  of  the  pressure  field  is  considered,  the  static  pattern  may 
be  classified  into  one  of  two  main  categories: 

(1)  the  identification  of  individual  circulation  features  such  as  high  and 
low  pressure  cells,  and  ridges  and  troughs  in  specific  locations,  and 

(2)  a  description  of  the  complete  pressure  pattern,  or  its  most  significant 
features,  by  subjective  or  objective  methods  (Baur  1951). 
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The  isobaric  chart  may  also  be  regarded  in  a  kinematic  framework.  Two 
important  lines  of  this  work  are: 

(1)  the  classification  of  cyclone  and  anticyclone  paths,  and 

(2)  the  classification  of  circulation  or  airflow  patterns. 

Finally,  meteorological  and  climatological  elements  may  be  treated  directly. 
Three  possible  approaches  are: 

(1)  the  decomposition  of  selected  elements  into  specific  weather  types. 

(2)  the  identification  of  air  masses  on  the  basis  of  selected  parameters, 

and 

(3)  the  abstraction  of  combined  weather  factors  by  means  of  mathematical 

techniques  (averages,  means,  deviations,  etc.). 

Because  subjective  analyses  of  pressure  fields  pose  many  problems,  objective 
classification  techniques  should  be  used.  The  primary  methods  are  correlation 
techniques,  specification  techniques,  and  empirical  orthogonal  functions  (EOFs). 

Extrapolation  and  kinematic  techniques  also  are  used  in  forecasting.  These 
procedures  depend  on  the  assumption  that  salient  circulation  features  are  shifting  in 
a  quasi-regular  manner.  Experience  suggests  that  this  is  most  appropriate  when  upper 
air  charts  have  well-delineated  troughs  and  ridges  and  strong  tendency  fields.  The 
method  becomes  unreliable  when  flow  patterns  are  weak.  Also,  developing  circulation 
modes  cannot  be  predicted  by  extrapolation  procedures  (Wadsworth  et  al.  1948). 

Forecasts  are  generally  successful  when  dealing  with  time  scales  up  to  a  few 
days.  But  the  day-to-day  movement  of  low  and  high  pressure  centers  is  often 


roughly  the  same  order  of  magnitude  as  their  dimensions.  This  rapid  motion  of 


weather  systems,  coupled  with  erratic  path,  speed,  and  development,  points  to  the 
difficulty  of  forecasting  for  more  than  a  day  or  two  by  using  standard  forecast 
methods  of  interpolation  and  extrapolation  (Neiburger  et  al.  1982).  The  most 
successful  extended-range  forecasting  procedures  must  involve  the  current  state  of 
the  atmosphere  and  the  manner  and  rate  of  evolution.  Forecast  methods  which  do 
not  include  evolution  of  systems  usually  limit  success. 

The  main  purpose  of  utilizing  theoretical  distribution  models  is  to  describe 
the  observed  statistical  or  stochastic  properties  by  the  most  suitable  and  reasonable 
functions  for  each  climatic  element  (Namias  1953).  Different  characteristics  caused 
by  different  locations  and/or  different  time  periods  may  be  explained  by  discrepancies 
of  estimated  parameters  under  the  same  distribution  model.  Hence,  many  trial  and 
error  proposals  of  applying  different  types  of  distribution  models  to  specific  climatic 
elements  will  not  always  be  appropriate  in  comparing  or  interpreting  climatological 
data,  even  if  the  goodness  of  fit  test  shows  the  acceptance  of  the  hypothesis  in  each 
model  (Suzuki  1979). 

In  the  estimation  of  the  parameters  for  an  applied  distribution  model,  the  max¬ 
imum  likelihood  (ML)  method  should  be  emphasized  above  all  others.  A  problem 
arises  though,  the  ML  estimators  are  usually  biased.  Also,  the  asymptotic  sample 
variances  of  the  ML  estimators  are  not  always  sufficiently  clear  in  statistical  clima¬ 


tology. 
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Suzuki  believes  that  three  statistical  tests  are  required  in  order  to  model  climatic 
fluctuations  or  sequences,  namely,  the  significance  test  of  trend,  the  test  of  harmonic 
coefficients,  and  ergodicity  (the  test  of  whether  or  not  any  state  could  occur). 

In  forecasting  the  state  of  the  atmosphere  at  grid  points,  there  is  a  problem  of 
obtaining  vector- valued  estimates  of  meteorological  variables  at  a  grid  point  based 
on  multivariate  information  from  several  sources.  The  problem  is  to  find  the  “best" 
linear  combination  of  the  criterion  by  minimizing  the  variances  of  the  estimators. 

Several  examples  of  statistical  prediction  schemes  in  climatology  are  available. 
Most  predictions  are  based  on  a  number  of  predictor  variables,  for  example  wind 
speed,  temperature,  and  sea  level  pressure.  While  prediction  can  be  made  more 
accurate  by  bringing  in  as  many  relevant  predictor  variables  as  possible,  some  of 
them  may  be  highly  correlated  among  themselves  and  the  individual  contribution  of 
some  may  be  marginal.  The  problem  of  selecting  the  best  set  of  predictor  variables 
arises  in  various  situations.  Stringer  (1972)  cited  some  examples  from  the  literature 
regarding  prediction  and  visibility.  Martin  et  al.  ( 1963)  considered  an  example  dealing 
with  forecasting  of  the  24-hour  movement  and  change  of  central  pressures  of  North 
American  winter  anticyclones. 

Several  criteria  for  defining  the  best  set  of  predictor  variables  and  various 
techniques  for  selecting  the  best  set  have  been  discussed  in  a  paper  by  Hocking  ( 1976). 
Thompson  (1978)  also  prepared  a  review  article  and  evaluated  several  significant 
methods.  Martin  et  al.  (1963)  suggested  a  forward  type  stepwise  procedure,  whereas 
Lund  (1971)  used  a  blended  stagewise  and  stepwise  procedure.  These  papers  offer  a 
general  approach  to  variable  selection.  No  one  method  will  always  work  in  every 
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situation.  Also,  none  of  these  techniques  for  selecting  the  best  set  of  predictor 
variables  are  designed  to  produce  a  best  set  with  a  guaranteed  probability  (Gupta 
and  Panchapakesan  1979). 

Statistical  weather  forecasting  is  based  on  the  principle  of  making  predictive 
inferences  about  future  weather  from  statistics  of  past  weather.  A  meteorological 
element  is  to  be  forecasted  from  observations  of  the  same  or  other  meteorological 
elements  at  previous  times.  Usually,  the  mathematical  form  of  the  relationship  is  not 
known,  and  such  relationships  must  be  determined  empirically. 

Statistical  forecasting  methods  are  mainly  objective,  being  completely  objective 
as  far  as  the  forecaster  is  concerned.  Given  values  of  the  predictors,  the  best  forecast 
of  the  predictand  is  uniquely  determined  by  the  model.  However,  some  degree  of 
subjectivity  and  judgment  enters  the  formulation  of  the  procedure  used  in  the  forecast 
Statistical  weather  forecasts  will  be  nearly  objective  as  possible  without  knowledge  of 
the  exact  physical  relationships  between  predictors  and  predictand.  Also,  statistical 
methods  will  make  the  most  objective  use  of  past  weather  information.  Often  times, 
objectivity  in  the  design  of  forecast  procedure  can  be  increased  only  by  an  increase 
in  the  complexity  of  the  method. 

Formulation  of  statistical  forecast  methods  follow  three  steps: 

( 1 )  The  relation  of  the  predictand  to  any  number  of  predictors  must  be 
investigated,  and  the  predictors  most  advantageous  for  the  method  must 
be  selected. 

(2)  Convenient  rules,  graphs,  or  equations  must  be  developed.  These 


should  be  in  a  convenient  form  for  use  in  future  forecasts. 
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(3)  The  reliability  of  the  chosen  relation  must  be  tested  on  a  new  set  of 
observations  of  the  predictors  as  well  as  the  predictand. 

Predictors  should  be  selected  on  the  basis  of  physical  reasoning,  if  at  all  possible. 
For  example,  the  occurrence  of  rain  is  known  to  be  influenced  by  the  amount  of 
moisture  in  the  air  and  vertical  velocity.  The  vertical  motion  is  often  not  measured 
directly,  but  is  known  from  theory  to  be  correlated  with  the  north-south  wind 
component  or  with  thermal  advection  aloft.  Hence,  some  moisture  parameter  (e.g., 
humidity  or  dewpoint  temperature)  and  some  wind  parameter  {e.g.,  wind  direction 
at  850  mb  and  700  mb)  would  be  useful  for  precipitation  forecasting. 

Physical  reasoning  by  itself,  however,  is  not  sufficient  for  the  choice  of  predictors. 
There  are  several  weather  elements  that  indicate  moisture  in  the  atmosphere:  wet- 
bulb  depression,  relative  humidity,  or  dewpoint  depression.  Statistics  of  the  predictors 
and  the  predictand  observed  in  the  past  often  indicate  which  element  to  choose. 

Another  important  question  involves  the  number  of  predictors.  In  theory,  the 
more  predictors,  the  better  the  prediction.  In  practice,  one  is  always  limited  to  a 
definite  number  of  observations.  A  certain  portion  of  the  original  data  must  be  set 
aside  for  modifying  and  testing  the  model.  If  there  were  n  predictors  and  only  n 
observations  of  each,  n  linear  equations  which  would  exactly  fit  each  observation  of 
the  developmental  sample.  But  it  would  probably  not  fit  future  samples  as  well.  The 
reason  for  this  is  that  the  developmental  sample  was  fitted  too  well:  it  has  explained 
with  predictors  variations  in  the  predictand  caused  by  observational  error,  small-scale 
or  short-term  fluctuations,  or  by  elements  not  included  in  the  set  of  n  predictors. 
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In  general,  the  number  of  predictors  must,  be  smaller  than  the  number  of  usable 
observations  of  each  predictor.  Just  how  much  smaller  depends  on  the  complexity 
of  prediction  method.  If  a  linear  regression  equation  is  used,  many  predictors  can 
be  brought  in.  The  addition  of  cross-products,  quadratic,  or  higher  order  terms 
means  that  additional  coefficients  must  be  determined  from  the  same  data  set.  Since, 
effectively,  the  number  of  coefficients  well  determined  from  a  given  number  n  of 
observing  periods  is  constant,  this  means  that  the  number  of  predictors  has  to  be 
reduced  correspondingly. 

It  is  difficult  to  state  the  optimum  number  of  predictors,  given  n  sets  of 
observations.  The  number  depends  on  the  characteristics  of  the  variables  as  well 
as  on  the  kind  of  relationship  to  be  established.  Testing  on  independent  data  is  likely 
to  indicate  whether  too  many  predictors  were  used. 

In  most  statistical  forecast  techniques,  predictors  are  selected  at  the  same  point 
for  every  forecast.  These  places  may  be  observing  stations,  or  they  may  be  grid  points 
at  which  variables  are  known  through  some  type  of  analysis.  These  techniques  are 
called  fixed-point  techniques. 

Though  trajectory  techniques  may  be  physically  more  satisfactory,  the  main 
drawback  is  that  air  trajectories  must  first  be  determined  objectively.  Because 
construction  of  the  trajectories  is  tedious  and  not  terribly  accurate  if  procedures  are 
objective,  these  methods  have  been  used  only  rarely  in  statistical  weather  forecasting. 

There  are  several  methods  of  statistical  forecasting.  Groupings  might  be: 

(1)  linear  regression  techniques, 

(2)  successive  graphical  regression  techniques. 
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(3)  stratification  methods. 

(4)  residual  methods,  and 

(5)  combined  methods. 

Multiple  linear  regression  became  popular  in  the  early  1950 's  due  to  its  objec¬ 
tivity  coupled  with  the  advent  of  the  electronic  computer.  Accuracy  in  prediction  of 
numeric  weather  elements  (pressure  or  temperature,  for  example)  compares  favorably 
with  predictions  based  on  dynamic  considerations  alone  or  those  based  on  subjective 
synoptic  techniques  (Murphy  et  al.  1988). 

Statistical  regression  techniques  have  the  advantage  over  dynamic  methods  that 
no  physical  restrictions  have  to  be  made  before  the  techniques  can  be  applied,  but 
there  is  no  guarantee  derived  coefficients  will  be  stable.  In  theory,  statistically  derived 
coefficients  of  a  regression  equation  should  shed  some  light  on  the  physics  of  the 
atmosphere  processes  which  lead  to  movement  and  development  of  synoptic  patterns. 
Both  physical  and  statistical  methods  should  lead  to  the  same  set  of  forecasting 
equations. 

Graphical  regression  is  the  subjective  equivalent  of  non-linear  regression  if  the 
predictand  is  numerical  and  of  discriminant  analysis  if  qualitative.  Stratification 
methods  consist  of  selecting  class  intervals  for  a  number  of  predictors,  and  testing 
which  combination  of  groups  is  favorable  to  certain  properties  of  the  predictand. 
Residual  methods  consist  of  correlating  errors  of  forecasts  based  on  one  variable  with 
another;  hence,  the  second  variable  is  used  to  correct  forecasts  based  on  the  first.  The 
process  can  be  continued  so  that  a  third  variable  is  used  to  make  corrections  based 


on  the  first  two. 
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Combining  forecast  schemes  is  reported  to  provide  more  accurate  predictions 
than  individual  forecasts  alone.  Considerable  hindcast  skill  could  be  gained  when 
combing  two  predictive  schemes  in  an  optimal  (i.e.,  errcr-minimizing)  fashion.  This 
method  was  applied  to  long-range  forecasting  of  monthly  mean  tropical  Pacific  sea 
surface  temperatures  (Fraedrich  and  Smith  1989). 


CHAPTER  III 


PROCEDURE 

Building  a  temperature  forecast  model  requires  several  ingredients.  First, 
topography  must  be  considered.  Is  wind  direction  "locked  in"  a  particular  direction 
by  nearby  mountain  ranges?  Is  the  station  at  such  an  elevation  that  it  affects  its 
weather?  Second,  a  climatology  must  be  developed.  The  aim  is  for  useful  detail, 
but  limited  data  and  resources  may  negate  the  most  useful  climatology  for  an  area. 
Is  there  a  need  for  monthly,  weekly,  or  hourly  averages?  What  time  period  is  going 
to  be  most  useful  to  the  forecaster?  Whichever  format  one  chooses,  the  climatic 
tables  should  be  helpful  to  the  forecaster, i.e.,  referenced  often.  Climatic  tables  that 
sit  on  a  shelf  and  are  accessed  twice  a  year  are  not  particularly  useful.  Third,  which 
modeling  approach  works  best  for  the  weather  parameter  to  be  forecasted?  The 
model  needs  to  be  developed  from  available  data.  Once  these  aspects  of  the  model 
are  organized,  it  needs  to  be  tested.  This  chapter  will  go  through  the  process  of 
constructing  a  climatology  for  the  region  and  building  a  medium-range  temperature 


forecasting  model. 
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Figure  2. 


Map  of  the  Soviet  Union  with  the  study  area  outlined. 


Topography 

South-central  European  Russia  generally  suffers  from  some  degree  of 


moisture 


deficit  most  of  the  year  because  of  its  distance  from  the  Atlantic  Ocean  (Fig.  2). 
The  southern  two-thirds  of  the  region  is  made  up  of  the  Black  Sea  steppes  along  the 


plains.  Elevations  of  any  consequence  are  generally  between  200-1000  ni  with  the 
exception  of  the  northeast  portion  of  the  study  area  where  the  southern  extreme  of 
the  Ural  mountains  extend.  Elevations  here  are  in  excess  of  1000  m.  Areas  in  the 
east  and  south  tend  to  be  more  drought  prone. 

Qualitative  Climatology 

During  the  winter  temperature  and  pressure  gradients  are  at  a  maximum  so 
that  the  atmospheric  circulation  is  most  intense,  with  large  meridional  components 
that  bring  widely  varying  air  masses  into  close  juxtaposition.  Summer  weather,  on 
the  other  hand,  is  more  the  result  of  a  combination  of  advective  factors  and  on-site 
radiation  exchanges,  and  therefore  zonal  characteristics  of  the  earth's  surface  must 
be  kept  in  mind  when  discussing  summer  climatic  controls  (Lydolph  1977). 

General  atmospheric  circulation  patterns  over  the  Soviet  Union  are  controlled 
by  a  relatively  simple  pattern  of  pressure  centers  which  vary  drastically  from  winter 
to  summer.  In  winter,  mean  sea  level  pressure  patterns  reveal  build-up  of  an  intense 
high  pressure  system  over  the  Asian  land  mass.  The  center  generally  overlies  the 
Mongolian  Peoples'  Republic  with  a  strong  ridge  extending  westward  into  an  area 
north  of  the  Caspian  and  Black  Seas.  This  winter  protrusion  is  Voyeykov's  (1948) 
so-called  “great  axis"  which  divides  a  generally  northeasterly  flow  of  air  along  the 
northern  Black  Sea  coast  from  the  southwesterly  flow  of  air  across  much  of  European 
Russia.  When  it  is  well  formed,  this  great  axis  protrudes  all  the  way  to  the  western 
coast  of  Europe  and  effectively  blocks  penetration  of  air  and  cyclonic  systems  from 
the  Mediterranean  region.  It  often  acts  as  a  wedge  which  divides  routes  of  cyclonic 
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coast  of  Europe  and  effectively  blocks  penetration  of  air  and  cyclonic  systems  from 
the  Mediterranean  region.  It  often  acts  as  a  wedge  which  divides  routes  of  cyclonic 
storms  and  sends  them  either  northeastward  into  the  Ob  estuary  or  southeastward 
into  the  mountains  of  central  Asia. 

During  the  last  week  of  February,  negative  tendencies  in  atmospheric  pressure 
set  in  over  the  southern  Urals  and  spread  in  all  directions  eventually  covering  much 
of  European  Russia.  In  April,  a  major  change  takes  place  in  the  sea  level  pressure 
pattern.  The  Asiatic  high  weakens  and  shifts  its  center  westward  to  northeast 
Kazakhstan.  It  appears  that  what  is  now  taking  place  is  a  shift  from  the  dominance  in 
the  east  of  the  Asiatic  high  to  the  dominance  in  the  west  of  the  Azores  high  protruding 
across  Europe. 

Throughout  the  summer  a  weak  ridge  extends  eastward  from  the  Azores  high 
across  central  European  Russia  as  far  east  as  central  Kazakhstan.  In  European  Russia, 
as  the  western  nose  of  the  wintertime  Asiatic  high  is  replaced  by  the  eastern  nose  of  the 
summertime  Azores  high,  surface  winds  generally  shift  from  south  or  southwesterly 
to  west  or  northwesterly  (Lydolph  1977). 

Contour  patterns  at  700  mb  shift  from  a  January  situation,  which  shows  a  weak 
trough  over  western  European  Russia,  to  a  July  situation  which  shows  a  generally 
zonal  flow  across  the  country  as  a  cool  surface  high  over  the  Arctic  turns  to  a  weak  low 
at  the  700  mb  level.  Thermal  effects  on  the  circulation  pattern  accumulate  upward 
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until  a  weak  trough  at  500  mb  appears  over  the  Ob  estuary.  These  subtle  changes  in 
pressure  patterns  at  the  surface  and  aloft  produce  significant  changes  in  the  direction 
of  movement  of  cyclones  and  anticyclones  during  the  winter  and  summer  seasons. 
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Mid- August  marks  the  culmination  of  summer,  often  when  significant  pressure 
changes  begin  to  take  place  at  the  surface  in  European  Russia.  By  mid-September 
a  closed  high  has  formed  which  is  elongated  in  an  east-west  direction.  This  is  the 
beginning  of  the  shift  from  the  dominance  of  the  Azores  high  from  the  west.  October 
brings  a  definite  high  pressure  cell  centered  east  of  Lake  Baykal.  The  Asiatic  high 
remains  rather  consistent  in  this  general  location  until  mid-March. 

The  majority  of  fronts  and  cyclone  tracks  are  positioned  along  the  peripheries 
of  the  land  mass.  The  polar  front  generally  lies  south  of  the  USSR  with  a  western 
segment  in  the  Mediterranean- Asia  Minor-Middle  East  area.  This  western  segment 
spawns  many  cyclonic  storms  which  affect  the  Soviet  Union.  Those  forming  in  the 
eastern  Mediterranean  typically  move  northeastward  across  the  Black  Sea-Caucasian 
area.  Thus  they  pull  the  polar  front  far  northward  into  the  country. 

Fronts  which  affect  most  of  the  Soviet  Union  this  time  of  year  are  segments  of 
the  Arctic  front.  These  fronts  sweep  across  the  breadth  of  the  country,  particularly 
the  European  sector,  as  far  south  as  the  Black  Sea  and  Caspian  Sea. 

Many  of  the  cyclonic  storms  which  affect  the  Soviet  Union  in  winter  originate 
either  in  the  Icelandic  low  or  in  the  Mediterranean  and  are  shunted  along  northern 
and  southern  borders  of  European  Russia  by  the  western  protrusion  of  the  Asiatic 
high. 

Quantitative  Climatology 

Appendices  B-J,  pages  54-135,  show  the  general  climatology  for  south-central 
European  Russia.  Weather  elements  are  summarized  by  month  and  time  (morning 
and  afternoon).  Notable  trends  in  data  are: 
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1.  Greatest  variability  during  winter  for  temperature,  visibility,  SLP,  and 
ceiling  height. 

2.  Largest  variation  in  temperature  for  winter  months,  late  night  and 
early  morning  hours. 

3.  Greatest  variation  in  temperature  for  afternoon  hours  during  summer 
months. 

One  station,  Astrakhan,  was  chosen  for  a  detailed  climatology  (see  Appendix 
B,  pages  54-111)  consisting  of  monthly,  pentad,  and  3-hourly  averages.  This  type  of 
detail  can  give  a  forecaster  a  better  feel  for  diurnal  and  seasonal  trends  in  the  various 
weather  elements  until  he  acquires  practical  forecast  experience.  Noticable  trends  are 
annotated  on  individual  tables  in  appendix  B. 

Model  Construction 

The  review  section  presented  several  modeling  approaches.  The  methods  vary 
widely  in  terms  of  forecasting  accuracy,  constraints  they  impose  on  the  user,  and 
their  complexity  and  intuitive  appeal  to  the  user.  Given  the  wide  choice  of  alternative 
forecasting  methods  available,  it  is  particularly  useful  for  the  forecaster  to  have  criteria 
that  can  be  used  in  selecting  and  comparing  alternative  methodologies. 

Effective  forecasting  methodology  for  a  given  situation  has  two  prerequisites. 
First,  a  range  of  alternatives  must  be  understood  and  recognized.  Second,  some 
systematic  procedure  for  comparing  strengths  and  weaknesses  of  alternative  methods 
in  different  situations  should  be  used.  The  following  sections  will  provide  such  a 


framework. 
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Criteria  for  comparing  and  selecting  a  forecasting  method  can  be  organized  in 
several  ways.  The  organization  used  here  establishes  a  set  of  priorities  for  the  criteria 
so  that  those  most  important  can  be  considered  first.  Under  any  system,  the  accuracy 
of  a  forecast  is  given  top  priority. 

Unfortunately,  forecasters  generally  terminate  the  evaluation  process  after  this 
first  criterion  of  accuracy  has  been  examined.  A  number  of  other  criteria  are  also  very 
important  as  a  part  of  this  evaluation  process.  These  criteria  include:  data  pattern 
to  be  forecast,  forecasting  time  horizon  to  be  covered,  cost  of  applying  alternative 
methodologies,  and  ease  of  application  (Makridakis  and  Wheelwright  1978). 

Data  pattern  is  important  in  selecting  a  forecasting  method  because  different 
methods  can  cope  with  only  certain  kinds  of  data  patterns.  Some  methods  can 
cope  with  a  very  wide  range  of  patterns,  but  these  are  usually  difficult  to  build  and 
difficult  to  apply:  thus  a  trade-off  is  involved  using  them.  Generally,  the  pattern 
element  can  be  divided  into  two  main  subparts:  patterns  that  repeat  in  time  (trend 
and  seasonal  components)  and  patterns  that  are  aperiodic  and  do  not  repeat  over 
fixed  time  intervals. 

The  time  horizon  criterion  for  evaluating  forecasting  methods  is  closely  related 
to  the  pattern  criterion.  Time  horizon  refers  to  the  time  length  of  forecast  (i.e..  2 
days,  5  days.  etc.).  Closely  related  to  time  horizon  to  be  forecast  is  the  time  required 
to  actually  prepare  the  forecast.  If  one  has  6  hours  to  prepare  a  forecast  the  approach 
will  be  different  than  if  the  preparation  time  is  only  30  minutes. 

Cost  elements  sometime  must  be  used  as  a  trade-off  against  such  things  as 
accuracy,  ease  of  application,  and  pattern.  Data  requirements,  computer  time  and 
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human  resources,  training  forecasters  in  use  of  new  forecasting  methodology,  and  use 
of  the  model  on  an  ongoing  basis  needs  to  be  considered  under  the  cost  element. 

Ease  of  application  might  be  thought  of  as  a  criterion  that  brings  together 
other  considerations  not  covered  by  the  previous  criteria.  Complexity,  timeliness  that 
forecasts  provide,  the  level  of  knowledge  required  for  application,  and  the  conceptual 
basis  and  ease  with  which  it  can  be  conveyed  to  the  final  user  of  the  forecast  are  all 
important  considerations.  A  common  stumbling  block  in  adoption  of  an  appropriate 
forecasting  technique  is  making  users  comfortable  with  the  technique  and  its  rationale 
so  that  they  can  effectively  judge  the  results  of  the  forecasting  method  and  its 
usefulness  (Panofsky  and  Brier  1976). 

After  comparing  models  in  the  review  section  and  considering  the  above  criteria 
for  a  suitable  model,  the  linear  regression  approach  was  chosen.  Ease  of  application 
to  this  particular  data  set  was  a  major  reason  in  choosing  the  regression  method. 
Since  the  data  set  only  contains  surface  synoptic  data,  other  more  elaborate  schemes 
were  abandoned. 


Developing  the  Regression  Equation 

Multiple  regression  is  defined  as  the  amount  of  change  in  a  dependent  variable 
associated  with  a  unit  change  in  independent  variables.  The  equation  for  multiple 
regression  analysis  is: 

V  =  ao  -  a  1-r  1  —  02^2  —  •••  -  anxn  (1) 


where  n  is  the  number  of  independent  variables,  and  an  is  the  coefficient  of  regression. 
In  general,  an  represents  an  expected  change  in  y  for  a  unit  change  in  xn.  holding  all 
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variable  or  variables  in  multiple  regression  analysis.  A  correlation  between  two  or 
more  variables  describes  the  strength  of  the  linear  relationship  between  them.  Large 
correlations  infer  a  strong  relationship  while  small  correlations  do  not  necessarily 
infer  any  relationship.  One  would  have  to  determine  whether  the  correlation  was 
due  to  sampling  errors.  All  atmospheric  elements  are  interrelated  to  some  extent 
even  though  the  physical  relationship  may  not  be  known  at  this  time.  Care  must 
be  exercised  when  forcing  statistics  on  innocent  data  because  relationships  can  be 
inferred  when  no  actual  cause  and  effect  interrelation  necessarily  exists. 

The  MAXR  method  of  stepwise  regression  analysis  was  implemented  by  SAS. 
The  procedure  begins  by  finding  a  one  variable  model  which  produces  the  highest 
R2  (proportion  of  the  variation  among  the  observed  values  that  is  attributable  to  the 
fitted  regression).  Another  variable  is  added  which  produces  the  greatest  increase  in 
R2.  Once  a  preliminary  two-variable  model  is  chosen,  each  of  the  variables  in  the 
model  is  compared  to  each  variable  not  in  the  model.  MAXR  determines  if  removing 
one  variable  would  increase  R2  After  comparing  all  possible  switches,  the  one  that 
produces  the  largest  R2  is  made.  The  process  continues  until  MAXR  finds  that  no 
switch  increases  R2 .  The  resulting  two- variable  model  is  the  best  two- variable  model 
the  technique  can  find  Another  variable  is  added  to  the  model  and  the  process 
continues  (Freund  and  Littell  1986). 

Should  cross-product  or  higher  ordered  terms  be  included  in  the  regressions'7 
A  procedure  that  calculates  all  possible  cross-product  and  quadratic  terms  for  the 
independent  variables  in  the  model  was  implemented  by  SAS  (RSREG).  An  F  statistic 
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is  given  to  indicate  significance  of  each  particular  term  along  with  an  overall  R2  value 
for  the  model.  The  findings  were: 

(1)  Quadratic  terms  and  cross-product  terms  did  not  significantly  increase  the 
overall  R 2  vaiue  for  the  model  during  winter. 

(2)  Quadratic  terms  generally  not  significant  for  summer. 

(3)  Cross-product  terms  significantly  increase  R2  value  during  the  summer  over 
most  every  station.  In  some  instances  the  amount  of  variation  explained  by 
cross-product  terms  more  than  doubled  the  overall  R2  values. 

Testing  the  Model 

Skill  of  a  forecast  should  be  defined  in  a  strict  statistical  sense,  that  is,  forecasts 
must  show  a  higher  proportion  of  successes  than  climatic  probability  of  occurrence 
(Allen  and  Vernon  1951).  The  skill  of  forecast  will  be  defined  simply  by  the  percent 
correct  or  total  number  of  correct  forecasts  divided  by  the  total  number  of  forecasts 
issued  (Brier  and  Allen  1951).  A  correct  forecast  will  be  defined  within  a  specific 
tolerance.  If  the  forecasted  temperature  ;s  within  2°C  of  the  actual  temperature  then 
the  forecast  will  be  judged  correct.  The  score  must  be  evaluated  with  respect  to 
several  forecast  methods.  Methods  compared  will  be: 

(1)  climatology  (defined  as  the  monthly  average  temperature  for  morning  and 
afternoon,  respectively), 

(2)  persistence  (defined  as  the  present  morning  or  afternoon  temperature  carried 
throughout  the  forecast  period). 

(3)  climatology  plus  persistence  divided  by  2, 
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(4)  regression,  and 

(5)  regression  combined  with  persistence  and  combined  with  climatology. 
Coefficients  of  the  regression  equation  will  be  developed  from  climatological 

data  and  use  the  current  observation  as  input.  Is  73%  correct  forecasts  significantly 
different  than  a  68%  correct  forecast  skill?  This  question  must  be  answered  when 
comparing  different  forecast  schemes  (i.c..  climatology,  persistence,  and  regression) 
to  determine  which  method  is  in  fact  “better”  than  the  others.  Procedure: 

(1)  Use  the  \2  test  with  1  degree  of  freedom. 

(2)  The  percent  correct  value  must  be  transformed  into  a  numeric  value  (i.e.. 
the  number  of  correct  forecasts). 

(3)  Take,  for  example,  113  correct  forecasts  and  70  correct  forecasts.  Are  they 
significantly  different?  Compute  the  test  statistic: 


2  (H3  -  70)2 

X  =  - To -  =  2« 


(4)  \2  must  be  larger  than  3.84  at  a  =  0.05  level  of  significance  to  be  considered 
significantly  different.  Because  26  is  greater  than  3.84.  113  correct  forecasts  are 
significantly  “better”  than  70  correct  forecasts. 

1970  was  randomly  selected  as  the  test  year.  This  year  was  deleted  from  the 
original  data  set  and  used  as  a  verification  period. 

First,  temperature  was  forecasted  for  up  to  5  days.  Dewpoint  temperature 
was  selected  as  an  additional  weather  element  to  test  the  methodology  of  forecasting 
using  a  linear  regression  approach.  Only  winter  morning  dewpoint  temperatures  will 


be  forecast. 


CHAPTER  IV 


RESULTS 


Temperature 

1.  The  final  forecasting  model  consists  of  4  sets  of  5  equations  (total  of  20 
equations)  for  summer  morning  and  afternoon,  and  winter  morning  and  afternoon 
(Table  1).  Systematic  error  in  particular  lags  (mostly  lag  2.  but  not  always)  was 
caused  by  the  interpretation  of  missing  values  by  SAS  (Fig.  3  and  Fig.  4).  Eliminating 
missing  values  completely,  plus  deleting  extreme  values  of  temperature,  dewpoint 
temperature,  and  sea  level  pressure  appears  to  have  abated  the  problem.  It  appears 
coefficients  for  a  particular  season  and  time  are  specific  to  that  domain,  id  est.  while 
spatially  averaged  regression  coefficients  produce  acceptable  forecasts  of  temperature, 
using  averages  for  the  day  [i.e.,  morning  and  afternoon)  or  averages  for  the  year  {i.e., 
summer  and  winter)  do  not  produce  satisfactory  forecasts. 

a)  Persistence  showed  a  higher  percentage  of  correct  forecasts  than  climatology 
during  winter  months. 

b)  Climatology  showed  a  higher  percentage  of  correct  forecasts  than  persistence 
during  summer  months. 

c)  The  average  of  climatology  and  persistence  (climatology  plus  persistence 
divided  by  two)  always  showed  a  higher  percentage  of  correct  forecasts  than 
persistence  alone  for  both  the  summer  and  winter  seasons. 


Table  1.  Regression  coefficients  from  5-dav  temperature  forecast  model  averaged 
over  9  stations. 


SUMMER 

AM 


TEMP  DEWPT 


UAD 


SLP 


DAY  +  1 

0.184 

0.153 

-0.170 

-0.027 

DAY +  2 

0.408 

0.C71 

-0.044 

-0.018 

DAY +  3 

0.244 

0.079 

-0.162 

-0.022 

DAY +  4 

0.495 

-0.017 

-0.163 

-0.010 

DAY +  5 

0.132 

0.126 

-0.125 

-0.013 

SUMMER 

PM 


TEMP  DEWPT 


UAD 


SLP 


DAY  +  1 

0.531 

0.156 

-0.199 

0.146 

DAY +  2 

0.889 

-0.053 

-0.157 

0.208 

DAY +  3 

0.273 

0.234 

-0.213 

0.234 

DAY +  4 

0.633 

0.075 

-0.105 

0.189 

DAY +  5 

0.392 

0.158 

-0.211 

0.248 

WINTER 

AM 


TEMP  DEWPT 


UAD 


SLP 


DAY  +  1 

0.592 

0.212 

0.064 

0.032 

DAY +  2 

0.486 

0.237 

0.113 

0.056 

DAY +  3 

0.252 

0.406 

0.199 

0.048 

DAY +  4 

0.594 

0.125 

0.291 

0.052 

DAY +  5 

0.436 

0.146 

0.229 

0.073 

WINTER 

PM 


TEMP  DEWPT 


UAD 


SLP 


DAY  +  1 

0.463 

0.284 

0.088 

0.042 

DAY +  2 

0.370 

0.298 

0.128 

0.069 

DAY +  3 

0.253 

0.364 

0.201 

0.064 

DAY +  4 

0.619 

0.130 

0.242 

0.083 

DAY +  5 

0.296 

0.258 

0.237 

0.100 

27 


— o— 


LAG  1 
LAG  2 
LAG  3 


Figure  3.  Temperature  residuals  for  5-day  temperature  forecasts  with  systematic 
error. 


d)  The  average  of  climatology  and  persistence  showed  a  higher  percentage  of 
correct  forecasts  than  climatology  alone  during  winter  months  (Table  2). 

e)  Climatology  alone  showed  a  higher  percentage  correct  forecasts  during  the 
summer  than  the  average  of  climatology  and  persistence  (Table  3). 

f)  Climatology  alone  showed  a  higher  percentage  of  correct  forecasts  in  July 
mornings  than  any  other  forecast  scheme. 
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Figure 


DAY 

4.  Temperature  residuals  for  5-day  temperature  forecasts  with  systematic 
error  removed. 


2.  Because  the  4  variable  prediction  model  for  high  and  low  temperature  showed 
higher  skill  (percent  correct)  in  forecasting  these  temperatures  than  persistence  or 
climatology,  it  was  chosen  to  forecast  temperatures  1-5  days  into  the  future  (Table  4). 
a)  Fig.  5  indicates  the  regression  equation  had  a  higher  skill  than  the  average 
of  persistence  and  climatology  in  all  but  three  cases  (total  of  108  cases). 


Table  2.  A  percent  correct  (T)  forecast  comparison  using  persistence  (P).  clima¬ 
tology  ( Claud  the  average  regression  equation  (R).in  5-day  temperature 
forecasts,  by  station,  lags  1-5  (test  year  1970). 


STATION  1:  ASTRAKHAN,  RS 


WINTER  SUMMER 

MORNING _ AFTERNOON _ MORNING  AFTERNOON 


FORECAST 

METHOD 

DEC 

JAN 

AM 

FEB 

AM 

DEC 

PM 

JAN 

PM 

FEB 

PM 

JUN 

AM 

JUL 

AM 

AUG 

AM 

JUN 

PM 

JUL 

PM 

AUG 

PM 

P 

36 

30 

38 

38 

35 

34 

1  8 

22 

25 

23 

22 

20 

c 

38 

42 

1  3 

35 

56 

14 

56 

73 

76 

52 

35 

57 

(P  +  C)/2 

55 

51 

12 

71 

60 

59 

45 

45 

40 

28 

32 

29 

R 

7  1 

82 

61 

77 

77 

65 

86 

65 

89 

_J8 

80 

52 

TOTAL 

FORECASTS 

130 

130 

115 

130 

125 

110 

125 

130 

125 

125 

130 

130 

STATION  2:  ORENBURG,  RA 


WINTER  SUMMER 

MORNING _ AFTERNOON _ MORNING  AFTERNOON 


FORECAST 

METHOD 

DEC 

AM 

JAN 

AM 

FEB 

AM 

rax: 

PM 

JAN 

PM 

FEB 

PM 

JUN 

AM 

JUL 

AM 

AUG 

AM 

JUN 

PM 

JUL 

PM 

AUG 

PM 

P 

22 

30 

19 

25 

23 

23 

1  8 

20 

24 

18 

16 

20 

c 

12 

9 

26 

31 

1  5 

0 

36 

50 

54 

20 

54 

27 

(P+C)/2 

4  6 

29 

24 

48 

22 

31 

42 

35 

37 

22 

32 

26 

R 

48 

68 

50 

50 

54 

39 

71 

8  1 

73 

41 

8  1 

70 

TOTAL 

FORECASTS 

130 

130 

115 

130 

125 

110 

125 

130 

125 

125 

130 

130 

STATION  3:  KALMIKOVO,  RA 


WINTER  SUMMER 

MORNING _ AFTERNOON _ MORNING _ AFTERNOON 


FORECAST 

METHOD 

DEC 

AM 

JAN 

AM 

FEB 

AM 

DEC 

PM 

JAN 

PM 

FEB 

PM 

JUN 

AM 

JUL 

AM 

AUG 

AM 

JUN 

PM 

JUL 

PM 

AUG 

PM 

P 

21 

23 

28 

3  1 

20 

26 

2  1 

20 

23 

18 

22 

18 

c 

27 

1  5 

0 

23 

19 

13 

68 

65 

65 

44 

38 

3  1 

(P  +  C)/2 

45 

56 

1  8 

52 

29 

27 

44 

39 

41 

22 

32 

25 

R 

62 

78 

60 

60 

37 

63 

63 

38 

57 

53 

71 

62 

TOTAL 

FORECASTS 

130 

130 

115 

130 

125 

110 

125 

130 

125 

125 

130 

130 

30 


Table  2. 


Continued 


STATION  4:  GURYEV,  RA 


WINTER  SUMMER 

MORNING  AFTERNOON _ MORNING _ AFTERNOON 


FORECAST 

METHOD 

DEC 

AM 

JAN 

AM 

FEB 

AM 

DEC 

PM 

JAN 

PM 

FEB 

PM 

JON 

AM 

JUL 

AM 

AUG 

AM 

JUN 

PM 

JUL 

PM 

AUG 

PM 

P 

24 

25 

32 

25 

30 

34 

1  9 

20 

22 

18 

22 

24 

c 

27 

31 

1  3 

31 

35 

64 

48 

62 

60 

48 

50 

27 

(P  +  C)/2 

42 

63 

1  6 

60 

36 

36 

46 

37 

38 

26 

32 

28 

R 

67 

82 

57 

72 

48 

60 

80 

57 

77 

52 

78 

6  1 

TOTAL 

FORECASTS 

130 

130 

115 

130 

125 

110 

125 

130 

125 

125 

130 

130 

STATION  5:  RYAZAN,  RS 


WINTER  SUMMER 

MORNING _ AFTERNOON _ MORNING _ AFTERNOON 


FORECAST 

METHOD 

DEC 

AM 

JAN 

AM 

FEB 

AM 

DEC 

PM 

JAN 

PM 

FEB 

PM 

JUN 

AM 

JUL 

AM 

AUG 

AM 

JUN 

PM 

JUL 

PM 

AUG 

PM 

P 

40 

38 

32 

44 

50 

29 

27 

24 

28 

22 

18 

29 

c 

19 

23 

9 

3  1 

1  9 

30 

52 

46 

54 

44 

3  1 

38 

(P+C)/2 

44 

50 

29 

59 

48 

45 

57 

35 

42 

26 

35 

37 

R 

92 

92 

77 

82 

84 

75 

87 

44 

79 

51 

75 

78 

TOTAL 

FORECASTS 

130 

130 

115 

130 

125 

110 

125 

130 

125 

125 

130 

130 

STATION  6:  PENZA,  RS 


WINTER  SUMMER 

MORNING _ AFTERNOON _ MORNING _ AFTERNOON 


FORECAST 

METHOD 

DEC 

AM 

JAN 

AM 

FEB 

AM 

DEC 

PM 

JAN 

PM 

FEB 

PM 

JUN 

AM 

JUL 

AM 

AUG 

AM 

JUN 

PM 

JUL 

PM 

AUG 

PM 

P 

40 

30 

25 

5  1 

33 

28 

22 

20 

24 

23 

20 

32 

c 

35 

19 

4 

38 

27 

1  3 

24 

46 

50 

20 

54 

19 

(P+C)/2 

48 

40 

28 

54 

53 

31 

42 

35 

48 

24 

30 

25 

R 

4  1 

63 

40 

71 

64 

63 

69 

62 

75 

38 

68 

53 

TOTAL 

FORECASTS 

130 

130 

115 

130 

125 

110 

125 

130 

125 

125 

130 

130 

Table  2.  Continued 


STATION  7:  KUYBYSHEV,  RA 


WINTER  SUMMER 

MORNING _ AFTERNOON _ MORNING _ AFTERNOON 


FORECAST 

METHOD 

DEC 

AM 

JAN 

AM 

FEB 

AM 

DEC 

PM 

JAN 

PM 

FEB 

PM 

JUN 

AM 

JUL 

AM 

AUG 

AM 

JUN 

PM 

JUL 

PM 

AUG 

PM 

P 

33 

23 

1  8 

37 

22 

27 

2  6 

18 

22 

23 

22 

26 

c 

1  2 

19 

1  3 

19 

42 

35 

32 

65 

38 

24 

35 

42 

(P  +  C)/2 

22 

56 

56 

25 

53 

50 

46 

36 

49 

1  6 

32 

28 

R 

28 

62 

63 

48 

53 

62 

7  1 

62 

76 

56 

63 

51 

TOTAL 

FORECASTS 

130 

130 

115 

130 

125 

110 

125 

130 

125 

125 

130 

130 

STATION  8:  VOLGOGRAD,  RS 


WINTER  SUMMER 

MORNING  AFTERNOON _ MORNING _ AFTERNOON 


FORECAST 

METHOD 

DEC 

AM 

JAN 

AM 

FEB 

AM 

DEC 

PM 

JAN 

PM 

FEB 

PM 

JUN 

AM 

JUL 

AM 

AUG 

AM 

JUN 

PM 

JUL 

PM 

AUG 

PM 

P 

38 

39 

34 

44 

43 

42 

18 

21 

28 

21 

22 

18 

c 

23 

12 

5 

27 

27 

14 

44 

46 

58 

24 

50 

46 

(P  +  C)/2 

46 

61 

15 

61 

51 

25 

41 

36 

49 

28 

3  1 

30 

R 

75 

83 

59 

78 

62 

67 

57 

55 

86 

49 

80 

68 

TOTAL 

FORECASTS 

130 

130 

115 

130 

125 

110 

125 

130 

125 

125 

130 

130 

STATION  9:  ROSTOV-ON-DON,  RS 


WINTER  SUMMER 

MORNING _ AFTERNOON _ MORNING _ AFTERNOON 


FORECAST 

METHOD 

DEC 

AM 

JAN 

AM 

FEB 

AM 

DEC 

PM 

JAN 

PM 

FEB 

JUN 

AM 

JUL 

AM 

AUG 

AM 

JUN 

PM 

JUL 

PM 

AUG 

PM 

P 

35 

26 

18 

42 

29 

21 

20 

23 

22 

23 

24 

29 

c 

15 

8 

26 

12 

19 

22 

52 

54 

27 

20 

54 

1  9 

(P  +  C)/2 

42 

27 

40 

44 

27 

49 

35 

36 

25 

31 

42 

49 

R 

64 

79 

73 

78 

72 

76 

75 

59 

68 

50 

75 

57 

TOTAL 

FORECASTS 

130 

130 

115 

130 

125 

110 

125 

130 

125 

125 

130 

130 

32 


Table  3.  A  percent  correct  (%)  forecast  comparison  using  persistence  (P).  clima¬ 
tology  (Cl.  and  the  average  regression  equation  (R)  in  5-day  temperature 
forecasts,  averaged  over  all  stations  (test  year  1970). 


WINTER  SUMMER 

MORNING _ AFTERNOON _ MORNING  AFTERNOON 


FORECAST 

METHOD 

DEC 

AM 

JAN 

AM 

FEB 

AM 

DEC 

PM 

irri 

FEB 

PM 

JIJN 

AM 

JUL 

AM 

AUG 

AM 

JUN 

PM 

JUL 

PM 

AUG 

PM 

P 

32 

29 

TO 

TOiri 

TOW 

29 

2  1 

21 

24 

2  1 

21 

WEL 1 

c 

23 

20 

12 

TO 

29 

23 

kb 

56 

54 

33 

KB 

KB 

nggia 

kb 

48 

26 

53 

KB 

39 

L44 

TOiri 

41 

25 

33 

3  1 

R 

61 

Ml 

60 

68 

6  1 

63 

HU 

53 

WBR 

58 

mm 

6  1 

TOTAL 

FORECASTS 

1170 

1170 

1035 

1170 

1125 

990 

1125 

1170 

1125 

1125 

1170 

1170 

I 


) 


> 


> 
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3.  A  2-day  wave  in  the  temperature  field  was  discovered.  Though  present  most 
of  the  year,  its  strongest  signal  occurred  in  summer  morning  (Fig.  6  and  Fig.  7)  . 

a)  R 2  values  for  winter  were  significantly  higher  than  R 2  values  for  summer. 

b)  Afternoon  R 2  values  were  higher  than  morning  R2  values  for  both  summer 
and  winter. 

c)  The  slope  of  the  line  for  R2  vs  lag  was  typically  -0.6  for  summer  and  -1.0  for 
winter  forecasts.  A  steeper  slope  would  indicate  a  more  rapid  decay  of  forecast 
skill  as  time  progressed. 

d)  Forecasts  combined  for  lags  1-5  were  broken  down  into  respective  lags. 
Percent  correct  forecasts  were:  71.  98.  61.  65.  and  10%,  respectively. 


I 
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Table  4.  A  percent  correct  (%)  forecast  comparison  using  a)  station  specific  regres¬ 
sion  coefficients  and  b)  averaged  regression  coefficients  over  9  stations  in 
the  5-day  temperature  forecast  model,  lags  1-5  (test  year  1970). 


a) 


STATION  NUMBER 

SEASON 

AND 

TIME 

1 

2 

3 

4 

5 

6 

7 

8 

9 

PERCEN1 

r  CORRECT (%)  ! 

WAM 

74 

65 

57 

59 

87 

69 

7  1 

83 

72 

WPM 

78 

54 

63 

59 

80 

66 

64 

88 

75 

SAM 

80 

77 

67 

7  1 

70 

59 

78 

78 

67 

SPM 

63 

74 

62 

64 

68 

63 

67 

68 

66 

b) 


STATN 

3N  NUMBER  ! 

SEASON 

AND 

TIME 

1 

2 

3 

4 

5 

6 

7 

8 

9 

PERCEN1 

r  CORRECT 

%)  i 

WAM 

72 

56 

48 

51 

73 

67 

69 

83 

72 

WPM 

73 

48 

60 

54 

69 

53 

60 

8  1 

75 

SAM 

78 

75 

67 

70 

66 

52 

71 

70 

63 

SPM 

63 

64 

49 

57 

66 

62 

64 

68 

6  1 

WAM  =  WINTER  MORNING:  Dec,  Jan,  Feb;  (04+07)LST/2 
WPM  =  WINTER  AFTERNOON:  (13+16)LST/2 
SAM  =  SUMMER  MORNING:  Jun,  Jul,  Aug 
SPM  =  SUMMER  AFTERNOON 

%  CORRECT  =  Forecast  temperature  within  ±  2K  of  actual 
temperature. 

Total  number  of  forecasts  for  each  time  period:  WAM  =  375, 
WPM  =365,  SAM  =  380,  SPM  =  385.  Based  on  the  sum  of  lags  1-5. 
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R  -  (C-P)/2  (%) 


Figure  5.  Frequency  of  the  difference  in  average  of  persistence  (P)  and  climatol¬ 
ogy  (C)  forecast  from  the  regression  forecast(R)  in  percent  correct  (%) 
forecasts  (108  cases). 

Dewpoint  Temperature 

1.  A  3  variable  model  was  selected  to  forecast  dewpoint  temperature  (Table  5). 
The  final  forecast  equation  consisted  of  temperature,  dewpoint  temperature,  and 
amount  of  sky  cover.  The  ceiling  element  (low,  middle,  and  high)  was  statistically 
significant  only  half  the  time.  Coupled  with  the  fact  that  R 2  only  increased  a  percent 
or  two,  this  weather  element  was  not  included  in  the  final  model. 


SUMMER  am 


SUMMER  pm 


Figure  6. 

Summertime  R 2  values  vs  lag 

r- 

3 

(00 

WINTER  am 

/ 

s 

(001 

WINTER  pm 

Wintertime  R 2  values  vs  lag 
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Table  5.  Regression  coefficients  from  5-day  dewpoint  temperature  forecast  model 
averaged  over  9  stations. 


WINTER 

AM 

TEMP 

DEWPT 

SKY 

COVER 

DAY  +  1 

0.382 

0.601 

0.013 

DAY +  2 

0.375 

0.540 

0.037 

DAY +  3 

0.368 

0.419 

-0.008 

DAY +  4 

0.372 

0.398 

-0.010 

DAY +  5 

0.338 

0.402 

-0.011 

a)  Forecast  dewpoint  temperatures  appear  2°C’  too  cold  through  all  lags  and 
stations.  This  would  appear  to  indicate  that  the  test  year  was  drier  and/or 
cooler  than  the  average  for  the  period  of  record. 

b)  As  in  the  temperature  forecast  model,  average  values  for  dewpoint  temper¬ 
ature  regression  coefficients  produced  satisfactory  forecasts  (Table  6). 

c)  Table  7  shows  the  regression  forecast  once  again  holds  the  best  forecast  skill. 

d)  Table  8  (values  averaged  over  all  stations)  shows  persistence  performs  better 
than  climatology  for  winter  morning  dewpoint  temperature  forecasts  while 
combining  persistence  and  climatology  shows  a  marked  improvement  in  forecast 


skill. 
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Table  6.  A  percent  correct  ( % )  forecast  comparison  using  station  specific  regression 
coefficients  and  averaged  regression  coefficients  over  9  stations  in  the  5-day 
dewpoint  temperature  forecast  model,  lags  1-5,  winter  morning  (test  year 
1970). 


STATION  NUMBER  1 

TYPE 

1 

2 

3 

4  5 

6 

7 

8 

9 

PERCENT  CORRECT  (%)  I 

SPECIFIC 

i!i 

89 

89 

mzm 

93 

93 

85 

92 

98 

AVERAGE 

87 

82 

87 

80 

79 

88 

83 

85 

89 

WAM  =  WINTER  MORNING:  Dec,  Jan,  Feb;  (04+07)LST/2 
%  CORRECT  =  Forecast  temperature  within  ±  2K  of  actual 
temperature. 

Total  number  of  forecasts  for  each  time  period:  WAM  =  375. 
Based  on  the  sum  of  lags  1-5. 


A  percent  correct  (°c)  forecast  comparison  using  persistence  (P).  climatol- 
°§y  (O-  an^  regression  equation  (R)  in  5-day  dewpoint  temperature 
forecasts  at  9  specific  stations,  lags  1-5  (test  vear  1970). 


4  1 

KZ3 

58 

9 

88 

68 

35 

85 

90 

82 

n 

130 

115 

STATION:  1 


FORECAST  DEC  JAN  FEB 
METHOD  JAM  AM  AM 


P  +  O/2 


TOTAL 

FORECASTS 


STATION:  2 


FORECAST  I  DEC  JAN  FEB 
METHOD  lAM  AM  AM 


29 


3  1  8 


P+O/2  I  74  581  35 


88  8  1 


TOTAL 

P0RBCASTS  130  130  115 


1^11 


STATION:  3 


FORECAST  I  DEC 
METHOD  IaM 


FEB 

AM 


27 


381  10  26 


70  32 


89  88  82 


TOTAL 

P0RECASTS  130  130  115 


STATION:  4 


FORECAST  (DEC  JAN  FEB 
METHOD  lAM  AM  AM 


KB 

30 

40 

38 

1  2 

4 

62 

82 

26 

77 

83 

80 

130 

130 

115 

P+C)/2 


TOTAL 

FORECASTS 


STATION:  5 


FORECAST  I  DEC  JAN  FEB 
METHOD  lAMjAM  AM 


33 


C  1  46  231  17 


80 


STATION:  6 


FORECAST  DEC  JAN  FEB 
METHOD  IaM  AM  AM 


28  24 


C _  42  271  9 


P+O/2  I  631  45  3  1 


88  91  86 


TOTAL 

P0RECASTS  130  130  1  15 


STATION:  7 


FORECAST  I  DEC  JAN  FEB 
METHOD  IAM  AM  AM 


31  22 


C  16  27  35 


P+O/2  t  29  69  5  1 


78  87  83 


TOTAL  T 

FORECASTS  130  130  1  15 


STATION:  8 


FORECAST 
Nm  rflOD 


JAN  FEB 
AM  lAM  AM 


TOTAL 


STATION:  9 


FORECAST 

METHOD 


KB 

KB 

KB 

35 

31 

9 

87 

59 

27 

87 

90 

78 

130 

130 

115 

ITT* 


JAN  FEB 
AM  lAM  AM 


R2SQ] 


TOTAL 


341 

18 

18 

12 

151 

30 

55 

KB 

62 

KB 

90 

89 

130 

130 

115 

TOTAL 

P0RBCASTS  130  130  1  15 
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Table  8.  A  percent  correct  (T)  forecast  comparison  using  persistence  iP).  climatol¬ 
ogy  (C).  and  the  regression  equation  (R)  in  5-day  dewpoint  temperature 
forecasts  averaged  over  9  stations,  lags  1-5  (test  year  1970). 


FORECAST 

METHOD 

DEC 

AM 

JAN 

AM 

FEB 

AM 

P 

42 

30 

3  1 

c 

35 

23 

20 

(P+C)/2 

68 

6  1 

36 

R 

84 

87 

8  1 

TOTAL 

FORECASTS 

1170 

1170 

1035 
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CHAPTER  V 

SUMMARY  AND  CONCLUSIONS 

1.  A  detailed  climatology  can  help  orient  a  forecaster  to  a  new  location.  It 
can  serve  as  the  forecaster's  “experience"  until  they  are  accustomed  to  the  weather 
peculiarities  of  the  area. 

2.  The  4  variable  regression  equation;  which  uses  temperature,  dewpoint 
temperature,  SLP,  and  wind  direction  to  forecast  high  and  low  temperatures  1-5  days 
into  the  future,  showed  a  higher  skill  than  persistence,  climatology,  or  the  average 
of  the  two.  A  better  comparison  would  have  been  to  check  the  developed  model 
against  what  is  currently  being  used  to  forecast  temperatures  on  this  time  scale. 
Unfortunately,  the  author  could  not  find  such  bulletins. 

a)  \2  tests  indicated  the  regression  coefficients  from  specific  stations  must  be 
used  to  forecast  temperature  When  average  values  for  the  coefficients  were 
used,  satisfactory  results  (within  the  specified  tolerance)  were  obtained. 

b)  R2  values  were  typically  0.45-0.65. 

c)  Using  the  arbitrary  2°C  tolerance  for  correct  forecasts,  60-85%  were  judged 
correct  for  the  test  year. 

3.  An  increase  in  forecast  skill  (percent  correct)  can  be  obtained  during  summer 
months  by  including  cross-product  terms  especially: 

a)  wind  direction  by  ceiling  height. 


b)  wind  direction  by  SLP, 
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c)  temperature  by  SLP. 

d)  temperature  by  amount  of  sky  cover,  and 

e)  dewpoint  temperature  by  SLP. 

An  increase  in  skill  may  be  attributed  to  the  interactions  of  the  above  weather 
elements. 

4.  The  2-day  wave  in  temperature  field  was  not  caused  by  frontal  passage 
(FROPA).  FROPA  is  not  a  regular  2-day  event,  rather  a  4-5  day  irregular  event, 
(lie.,  the  average  frequency  of  FROPA  is  on  the  order  of  4-5  days,  but  does  not 
necessarily  occur  every  4th  or  5th  day).  Increases  in  R 2  values  from  previous  lags 
were,  in  fact,  significantly  different  from  each  other.  One  possibility  is  an  upper 
tropospheric  wave  manifesting  itself  in  the  temperature  field.  Such  a  wave  could  be 
quite  regular  and  influence  the  temperature  field  such  that  the  second  and  fourth 
days'  forecast  shows  more  skill  than  the  first  and  fifth  days'  forecast,  respectively. 

a)  Winter  R 2  values  are  higher  than  summer  R2  values  because  there  is  a  smaller 
range  (average  afternoon  temperature  minus  average  morning  temperature)  of 
temperature  in  winter. 

b)  Afternoon  R2  values  are  higher  than  morning  R2  values  throughout  the 
year  because  there  is  less  variability  in  afternoon  temperatures  than  morning 
temperatures. 

c)  Percent  correct  forecasts,  by  lag,  followed  the  general  pattern  of  R2  values, 
that  is,  an  increase  in  forecast  skill  was  noted  on  days  2  and  4  compared  to  days 
1  and  3,  respectively.  Because  the  forecast  skill  on  day  5  decreased  dramatically 
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(from  65%  to  10%  going  from  day  4  to  day  5,  respectively)  using  the  regression 
equation,  persistence  or  the  average  of  climatology  and  persistence  (percent 
correct  forecasts  were  27%  and  26%,  respectively)  should  be  used  to  forecast 
temperatures  on  day  5.  \  definite  conclusion  cannot  be  drawn  from  the  results 
though  due  to  the  facts  that  the  sample  size  was  small  (only  25  days  were 
forecast)  and  there  was  subjectivity  in  the  definition  of  a  "correct"  forecast 
(i.e.,  within  2K  of  the  actual  temperature). 

5.  High  and  low  dewpoint  temperatures  forecast  1-5  days  into  the  future  used 
temperature,  dewpoint  temperature,  and  amount  of  sky  cover  as  the  independent 
variables. 

6.  Combining  climatology  and  persistence  significantly  increased  forecast  skill 
compared  to  climatology  or  persistence  alone  for  dewpoint  temperature  forecasts. 

i.  Comparing  percent  correct  forecasts  for  persistence,  climatology,  and  regres¬ 
sion.  dewpoint  temperature  showed  a  higher  skill  than  the  temperature  forecast  .  One 
reason  is  probably  due  to  less  vaiiability  in  dewpoint  temperature  than  in  tempera¬ 


ture. 
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CHAPTER  VI 

RECOMMENDATIONS 

1.  Choose  a  data  set  that  is  complete,  reliable,  and  easily  accessible.  Needless 
time  will  be  wasted  if  this  advice  is  not  heeded.  Often  times  there  may  be  no  choice 
in  the  matter. 

2.  Fill  in  the  missing  data  via  spatial  and  temporal  interpolation,  utilize  other 
published  data  sources,  and  increase  the  period  of  record.  Introduce  additional 
stations  to  expand  the  study  area. 

3.  Utilize  700  mb  and  500  mb  data  from  computer  tapes  for  the  same  period  of 
record.  Also,  introduce  synoptic  features  into  the  model.  Example:  if  a  cold  front  is 
north  of  the  station,  drop  the  forecasted  temperature  by  t  degrees. 

4.  Expand  the  regression  model  to  forecast  other  weather  variables  such  as  wind 
speed  and  direction,  visibility,  pressure,  and  sky  condition.  Also,  expand  the  forecast 
period  to  each  month  of  the  year. 

5.  Concentrate  on  alternate  modeling  approaches  such  as  non-linear  regression, 
time  series  analysis,  empirical  orthogonal  functions,  etc.  Also  introduce  trend  vari¬ 
ables  in  the  model  ( e.g .,  the  difference  in  SLP  from  yesterday  to  today  may  indicate 
a  low  pressure  system  approaching  the  station).  This  information  will  be  valuable 
and  should  be  included  in  the  forecast  scheme. 

6.  Most  of  the  number  crunching  should  be  done  at  a  centralized  facility  where 
computer  resources  are  handy.  Specific  stations  should  be  tasked  to  define  local 
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topography  and  descriptive  climatology.  The  amount  of  data  (at  least  20  years?)  to 
be  utilized  in  developing  useful  regression  forecast  equations  negates  a  local  weather 
station  from  doing  such  because: 

a)  data  files  probably  do  not  exist  at  this  level  and 

b)  a  large  enough  computer  to  handle  the  data  manipulation  and  statistical 
analysis  probably  is  not  readily  available. 
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APPENDIX  A 


COMPUTER  PROGRAMS  AND  CALCULATIONS 

This  appendix  contains  the  computer  programs  used  to  manipulate  the  data 
and  run  the  regression  equations. 

COMPUTER  PROGRAMS: 

A-l.  Climatology 
A-2.  Linear  regression 

A- 3.  Temperature  forecasts  with  percent  correct  calculations 

A-4.  Comparison  of  full  (specific  coefficients)  to  the  reduced  model  (average 

coefficients)  using  the  TEST  procedure 

A -5.  Comparison  of  climatology,  persistence  in  percent  correct  forecasts 
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A  —  1 


2  .  THIS  PROGRAM  IS  US£C  TC  OEVELOP  A  CLIMATOLOGY  eOR  A  STATION 

3  . * . . . . 

4  data  PENT  A03 ; 

5  INFILE  ini  MISSOVER; 

6  INPUT  ID  YR  MO  OA  HR  OD  SP  SLP  TEMP  DEWPT  PRECIP  SKY  VIS  WX  X  CIG: 


» 


I 


I 


I 


» 


8  IF  1959<«YR<. 1986  AND  MO*12  THEN  YR-YR* 1 ; 

9 

io  . . . .  oefine  seasons  and  times  . . 

i  1  : 

12  IF  MO-i  OR  MO-2  OR  M0*12  THEN  SEASON- ' WINTER ' : 

13  ELSE  IF  MO-6  OR  MO-T  OR  MO-8  THEN  SEASON- ' SUMMER ' ; 

14  ELSE  DELETE; 

15 

16  IF  HR-0  OR  HR-3  THEN  TIME-'AM'; 

17  ELSE  IF  HR-9  OR  HR-12  THEN  TIME-'PM'; 

18  ELSE  DELETE; 

19 

20  ......  DEFINE  PENTADS.  IMPORTANT!  NOTICE  HOW  PENTAD  6  IS  DEFINED.  •**• 

21  ......  THIS  WILL  RESULT  IN  AN  UNEQUAL  NUMBER  OF  OBSERVATIONS  IN  THIS  ••• 

22  ......  GROUP  DEPENDING  ON  THE  NUMBER  OF  DAYS  IN  THE  MONTH!  ............ 

23. 

24  IF  i<«0A<-5  THEN  P-1: 

25.  ELSE  IF  6<-0A<- 10  THEN  P-2; 

26  ELSE  IF  1 1<-0A<- 15  THEN  P-3; 

27  ELSE  IF  16<-OA<-20  THEN  P-4; 

28  ELSE  IF  21<-0A<-25  THEN  P-5; 

29  ELSE  IF  26<-0A<-3l  THEN  P-6: 

30  ELSE  DELETE; 

31  ; 

32  ..........  CALCULATE  MEANS  of  VARIOUS  WEATHER  ELEMENTS  By  MO.  SEASON.  ." 

33  ..........  and  time,  print  the  mean,  number  of  observations  IN  CAL- 

34  ..........  CULATIONS.  MAX  AND  MIN  VALUE  OF  WX  ELEMENT.  VARIANCE.  - . 

35  . —  STANDARD  DEVIATION.  AND  COEFFICIENT  OF  VARIATION.  ......... 

36 

37  PROC  SORT ; 

38  BY  SEASON  P  TIME; 

39  PROC  MEANS  NOPRINT; 

40  BY  SEASON  P  TIME: 

41.  VAR  TEMP  OEWPT  VIS  SKY  SLP  CIG  PRECIP: 

42.  OUTPUT  OUT-NEW  MEAN-TEMP  DEWPT  VIS  SKY  SLP  CIG  PRECIP 

43  N-NUMT  NUMO  NUMV  NUMS  NUMP  NUMC  NUMR 

44  NMISS-TMIS  dmis  vmis  smis  pmis  cmis  rmis 

45  MIN-TMIN  OMIN  VMIN  SMIN  PMIN  CMIN  RMIN 

46  MAX-TMAX  OMAX  VMAX  SMAX  PMAX  CMAX  RMAX 

47  VAR-TVAR  OVAR  WAR  SVAR  PVAR  CVAR  RVAR 

48  std-tsto  ostd  vstd  sstd  pstd  cstd  rsto 

49.  CV-  TCV  OCV  VCV  SCV  PCV  CCV  RCV ; 

50 

51.  . .  DELETE  USELESS  DECIMAL  PLACES  ................. 

52 

53  DATA  TRUNC;  SET  NEW; 

54  TEMP. ROUND (TEMP.  1); 

55  DEWPT -ROUND (DEWPT.  1); 

56  SLP-ROUNO( SLP , . 1 ) ; 

57  VI S -ROUND ( VIS.  11; 

58  CIG-ROUNDICIG.  1): 

59  VMIN-ROUNO( VMIN. . 1 ) . 

60  VMAX-ROUNO( VMAX .  1); 

61  TVAR»R0UN0(TVAR.  1); 

62  DVAR-ROUNO(OVAR.  1); 

63  WAR-ROUND  (  WAR  .  .  1  )  ; 

64  SVAR-ROUND ( SVAR . . 1 ) : 

65  PVAR-ROUNO( PVAR . . 1 ) ; 

66 

67  PROC  PRINT  OATA-TRUNC;  BY  SEASON; 

68  VAR  p  TIME  NUMT  TMIS  TMIN  TEMP  TMAX  TVAR  TSTD  TCV 

69  NUMD  DMIS  OMIN  OEWPT  DMAX  DVAR  DSTD  DCV 

70  NUMV  VMIS  VMIN  VIS  VMAX  WAR  VSTD  VCV 

71  NUMP  PMIS  PMIN  SLP  PMAX  PVAR  PSTD  PCV 

72  NUMC  CMIS  CMIN  CIG  CMAX  CVAR  CSTD  CCV; 
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2  . .  THIS  PROGRAM  DIVIDES  weather  elements  into  categories. 

3  .  WEIGHTS  THEM  AND  THEN  USES  LAG  “UNCTIONS  TO  DEVELOP  --• 

A  .  LINEAR  REGRESSION  EQUATIONS  TO  FORECAST  A  HIGH  AND  LOW 

5  .  TEMPERATURE  1-5  OATS  IN  THE  FUTURE.  .................. 

S  * . . 

7  DATA  DENT  ADO; 

S  INFILE  INI  MISSOVER; 

9  INPUT  id  VR  MO  OA  HR  00  SP  SLP  TEMP  DEWPT  DRECIP  SKY  VIS  WX  X  CIG; 

10  IF  1959<- YR<- 1 986  AND  MO-12  THEN  YR.YR.i- 

11.  IF  YR  NE  1970; 

12  IF  TEMP-.  THEN  DELETE; 

13  IF  OEWPT. .  THEN  DELETE; 

14  IF  DD- .  THEN  DELETE; 

15  IF  SLP-  THEN  DELETE; 

16 

17.  IF  MO-1  OR  MO-2  OR  MO-12  THEN  SEASON- ' WINTER ' ; 

18.  ELSE  IF  MO-6  OR  MO-7  OR  MO-8  THEN  SEASON- ' SUMMER ' ; 

19.  ELSE  OELETE ; 

20.  ; 

21  IF  HR-0  OR  HR-3  THEN  TIME-'AM'; 

22.  ELSE  IF  HR-9  OR  HR-12  THEN  TIME-'PM'; 

23.  ELSE  DELETE; 

24 

25  PROC  SORT  ; 

26  BY  SEASON  OA  TIME; 

27  PROC  MEANS  NOPHINT; 

28  BY  SEASON  OA  TIME; 

29  VAR  00  SP  TEMP  OEWPT  PRECIP  SKY  VIS  WX  CIG; 

30  OUTPUT  OUT-NEW  MEAN-00  SP  TEMP  DEWPT  PRECIP  SKY  VIS  WX  CIG; 

31  DATA  EST; 

32  SET  NEW; 

33  BY  SEASON  OA  TIME: 

34 

35  ..........  DIVIDE  THE  WINO  DIRECTION  (DD)  INTO  THE  8  POINTS  OF  •* 

36  . .  THE  COMPASS  THEN  WEIGHT  THE  DIRECTIONS  (HIGHER  IF  ••* 

37  ..........  DO  IS  FROM  A  SOUTHERLY  DIRECTION.  LOWER  IF  00  IS  FROM 

38  . .  i  NORTHERLY  DIRECTION.  .............................. 

39 

40  IF  0<“0D<»22 . 5  ANO  SP>«2  THEN  QUAD" 1 : 

41  IF  342 . 5«-00<-360  ANO  SP>*2  THEN  OUAD-1; 

42  ELSE  IF  22.5<0D<-67.5  ANO  SP»2  THEN  OUAD-2: 

43.  ELSE  IF  67 . 5<0D<- 112.5  ANO  SP>-2  THEN  QUAD-4 ; 

44  ELSE  IF  112. 5<0D<* 157 . 5  ANO  SP>"2  THEN  0UAD«6; 

45  ELSE  IF  157 . 5<00<"202 . 5  ANO  SP>*2  THEN  0UA0-8: 

46  ELSE  IF  202 . 5<00<-247 . 5  ANO  SP>*2  THEN  0UA0-7 ; 

47  ELSE  IF  247 . 5<00<«292 . 5  AND  SP»*2  THEN  OUAD-5:- 

48  ELSE  IF  292 . 5<0D<«342 . 5  ANO  SP»*2  THEN  QUAD-3; 

49  ELSE  IF  SP<2  THEN  QUAO-O: 

50 

51.  ..........  01 VIDE  THE  CEILING  HEIGHT  (CIG)  INTO  LOW.  MIDDLE.  ANO 

52  . -  WEIGHT  ACCORDINGLY  . . . . 

53  . 

54  IF  0<*CIG<-56  THEN  C-10; 

55  ELSE  IF  57<.CIG<*69  THEN  C-25; 

56  ELSE  IF  70<-CIG<*89  THEN  C-50: 

57  ELSE  IF  90<«CIG<-97  THEN  C-10: 

58  ELSE  OELETE; 

59 

SO  ......................  WEIGHT  THE  SKY  COVER  (SKY)  ........... 

6  1 

62  IF  SKY *0  THEN  CLD-10; 

63  ELSE  IF  SKY  - 1  THEN  CLD-10; 

64.  ELSE  IF  SKY-2  THEN  CLD-15; 

65.  ELSE  IF  SKY- 3  THEN  CL0-20: 

66  ELSE  IF  SKY. 4  THEN  CLD-35: 

67  ELSE  IF  SKY-5  THEN  CLD»50; 

68  ELSE  IF  SKY -6  THEN  CLD-70: 

69  ELSE  IF  SKY-7  THEN  CLD-80; 

70  ELSE  IF  SKY-8  THEN  CLD*90 ; 

71  ELSE  IF  SKY. 9  THEN  CLO-IOO: 

72 

73  . .  OIVIDE  THE  WEATHER  INTO  GENERAL  CATEGORIES  (RAIN. 

74  ..........  SHOWERS.  SNOW.  )  ANO  WEIGHT.  ..................... 

75 

76  IF  4<«WX<-9  THEN  W-10: 

77  ELSE  IF  30<-WX<-39  THEN  W-10: 

78  ELSE  IF  40< -WX < .49  THEN  w-20: 

79  ELSE  IF  50<*WX<.59  THEN  w-45 : 


80  ELSE  IE  60<»WX<*69  THEN  w-60; 

81.  ELSE  IE  70<»WX<»79  THEN  w-80; 

82  ELSE  IE  80<-WX<-94  THEN  w-65  : 

83.  ELSE  IE  95<-WX<*99  THEN  w-90: 

84 

85  ..........  CREATE  A  DATA  SET  WHICH  WILL  BE  USED  TO  STORE  THE  LAG 

8S  ..........  variables,  next  create  data  subsets  eor  each  lag  to 

87  ..........  include  each  weather  element  ...................... 

38 

89.  DATA  ONE; 

90  DO  YR* 1959  TO  1987; 

91 .  DO  MO* 1 .2.6. 7,8. 12; 

92 .  DO  DA*1  TO  31 ; 

93  00  HR-O.3.9. 12; 

94  OUTPUT; 

95 .  END ; 

96  .  END  ; 

97  END ; 

98.  END; 

99. 

100.  PROC  SORT  OATA-ONE;  BY  YR  MO  DA  HR: 

101.  PROC  SORT  0ATA-PENTA01 1 ;  BY  YR  MO  DA  HR; 

102.  DATA  TWO: 

103.  UPDATE  ONE  PENTAD  1 1 : 

104  BY  YR  MO  DA  HR; 

105 .  ; 

106  OATA  LI  : 

107.  SET  TWO; 

108  BY  YR  MO  OA  HR; 

109  T 1 -LAG( TEMP ) : 

110  P 1 *LAG( SLP ) ; 

111  Di*LAG(OUAD ) : 

112  SI-LAG(SP): 

113  DP1«LAG(0EWPT ) ; 

114  W 1  *  LAG( W ) ; 

115  C 1 *LAG( C ) : 

116  XI-LAG(CLO): 

117.  IF  DA-1  THEN  DELETE: 

118 

119  PROC  SORT  OAT A- Li ; 

120  BY  SEASON  TIME: 

121 

122  DATA  L2; 

123  SET  TWO: 

124  BY  YR  MO  DA  HR; 

125  T2-LAG2I TEMP ) : 

126  P2-LAG2ISLP! : 

127  02-LAG2I OUAD ) ; 

128  S2-LAG2ISP); 

129  DP2-LAG2I0EWPT): 

130  W2-LAG2I W ) : 

131  C2*LAG2(C); 

132  X2-LAG2(CL0): 

133  IF  1 <-0A<-2  THEN  OELETE ; 

134 

135  PROC  SORT  0ATA-L2 ; 

136  BY  SEASON  TIME: 

137 

138.  DATA  L3 : 

139  SET  TWO; 

140  BY  YR  MO  OA  HR; 

14  1,  T3-LAG3I TEMP  I ; 

142  P3*LAG3(SLP); 

143.  03-LAG3I OUAD ) ; 

144  S3-LAG3! SP ) : 

145  DP3-LAG3(DEWPT) ; 

146  W3-LAG3(W); 

147  C3-LAG3I C ) : 

148.  X3-LAG3I OLD ) ; 

149.  IF  1 <-0A<-3  THEN  OELETE: 

150 

151  PROC  SORT  DATA-L3 ; 

152  BY  SEASON  TIME: 

153  . 

154  DATA  L4; 

155  SET  TWO; 

156  BY  YR  MO  OA  HR; 

157  T4 - LAG4 ( TEMP ) : 

158  P4-LAG41 SLP ) ; 

159  D4-LAG410UAO ) : 
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'SO  S4-LAG4I SP ) . 

'61  0P4"LAG4(DEWPT ) ; 

'SO  w4»lAG4( w ) : 

'63  C4«LAG4(C); 

'64  X 4  a L AG4 ( CLO ) ; 

'65  IF  1 < »OA< *4  THEN  DELETE; 

'  66 

'67  PROC  SORT  0ATA-L4; 

'68  BY  SEASON  TIME ; 

169 

170  DATA  L5 ; 

171  SET  TWO; 

'72.  BY  YR  MO  OA  HR; 

'73.  T5«LAG5(TEMP ) ; 

'  74 .  P5-LAG51SLP) : 

175  05*LAG5(OUAO ) : 

'76  S5-LAG5(SP); 

'77  0P5*LAG5(DEWPT) ; 

’78.  W5«LAG5(W); 

'79.  C5«LAG5(C); 

'80  X5-LAG5(CL0) ; 

’81  IF  1<*0A<«5  THEN  DELETE: 

182  . 

133.  PROC  SORT  OATA«lS: 

'84.  BY  SEASON  TIME; 

185 

186  .  IMPLEMENT  A  linear  REGRESSION  PROCEDURE  (LEAST  SOUARES) 

187  . .  THE  FUTURE  TEMPERATURE  (TEMP)  IS  CORRELATED  TO  THE  TEMP  --- 

188  ..........  ( T ) .  DEWPOINT  TEMP  (DP),  SLP(P).  AND  WIND  DIRECTION  ( DD )  •* 

'89  . .  N  0AYS  PREVIOUS.  WHERE  N  •  LAG.  . . ......I... 

190. 

191  PROC  SORT  DATA«L1; 

192  BY  SEASON  TIME; 

193  PROC  REG  DATA.L1  ; 

'94  MODEL  TEMP-T1  DPI  PI  01; 

195  PROC  REG  DATA-L2  ; 

196  MODEL  TEMP-T2  DP2  P2  D2; 

197 .  PROC  REG  DATa*L3  ; 

198  MOOEL  TEMP.T3  0P3  P3  D3 ; 

199  PROC  REG  DATA-L4  ; 

200.  MODEL  TEMP-T4  0P4  P4  04; 

201  PROC  REG  DATA-L5  ; 

202.  MODEL  TEMP-T5  0P5  P5  OS; 
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2  ..........  THIS  PROGRAM  TAKES  the  REGRESSION  COEFFICIENTS  FOR  A  -------- 

3  ..........  PARTICULAR  weather  element  and  predicts  the  temperature  ---- 

4  ..........  ,.5  QAYS  IN  THE  FUTURE.  The  PERCENT  CORRECT  (%)  FORE-  ..... 

5.  . .  CASTS  ARE  CALCULATED  TOO.  . . . . 

6 .  . . . . . . . 

7  .  ; 

8  DATA  PENT ADZ ; 

9  INFILE  INI  MISSOVER: 

10  INPUT  ID  YR  MO  DA  HR  DO  SP  SLP  TEMP  OEWPT  PRECIP  SKY  VIS  WX  X  CIG: 

11  IF  1959<*YR<*1986  AND  M0*12  THEN  YR-YR* 1 ; 

12 .  ; 

13.  IF  YR.1970: 

14  . 

IS.  . .  BECAUSE  THE  FORECAST  PERIOO  IS  S  OAYS  INTO  THE  FUTURE  . 

16  ..........  AND  ONLY  ’COMPLETE'  FORECASTS  (THE  ENTIRE  S  DAY  PERIOO)  **** 

17  ..........  ARE  GOING  TO  BE  ANALYZED.  THE  INITIAL  FORECAST  DAY  WILL 

18.  ..........  END  5  DAYS  PRIOR  TO  THE  END  OF  THE  MONTH.  .................. 

19. 

20.  IF  MO-1  OR  MO-7  OR  M0*8  OR  M0« 12  THEN  DO: 

21 .  IF  0A<27 ; 

22.  ENO: 

23.  IF  MO-2  THEN  DO; 

24.  IF  DA <2 4 ; 

25.  END: 

26  IF  M0*6  THEN  00: 

27  IF  0A<26; 

28.  END: 

29. 

30.  IF  MO-1  OR  MO-2  OR  M0«12  THEN  SEASON* 'WINTER ' : 

31.  ELSE  IF  M0*6  OR  MO-7  OR  M0*8  THEN  SEASON* ' SUMMER • ; 

32  ELSE  DELETE: 

33  : 

34  IP  HR-0  OR  H«*3  THEN  TIME* 'AM'; 

35.  ELSE  IF  HR*9  OR  MR-12  THEN  T1ME*'PM': 

36.  ELSE  DELETE: 

37  ; 

38.  IF  0<*00<*22.5  AND  SP»*2  THEN  OUAD-1: 

39.  IF  342. 5< *00"* 360  AND  SP»*2  THEN  QUAD-1; 

40  ELSE  IF  22 . 5<D0<-67 . 5  AND  $P>*2  THEN  0UA0*2: 

41.  ELSE  IF  67. 5<00<* 112.5  AND  SP»*2  THEN  OUAD-4; 

42.  ELSE  IF  112. 5<00<*157 . 5  AND  SP>*2  THEN  OUAD-6 ; 

43  ELSE  IF  157 . 5<DD<*202 . 5  AND  SP>-2  THEN  OUAD-B : 

44  ELSE  IF  202 . 5<0D<*247 . 5  ANO  SP»*2  THEN  0UA0*7; 

45.  ELSE  IF  247 . 5<0D<*292 . 5  AND  SP»*2  THEN  OUAD-5 : 

46.  ELSE  IF  292 . 5<00<*342 . 5  ANO  SP>*2  THEN  0UAD*3: 

47.  ELSE  IF  SP<2  THEN  QUA0*O; 

48. 

49  .  PROC  SORT ; 

50.  8Y  MO  OA  TIME: 

51.  PROC  MEANS  NOPRINT ; 

52.  BY  MO  OA  TIME ; 

33  VAR  TEMP  OEWPT  QUAD  SLP; 

54  OUTPUT  OUT*NEW  MEAN-TEMP  OEWPT  OUAO  SLP: 

55 

56  OATA  EST; 

57  SET  NEW; 

38.  BY  MO  OA  TIME; 

39 

60.  IF  MO* 1  OR  MO-2  OR  MO* 12  THEN  SEASON* ' WINTER ' ; 

61.  ELSE  IF  M0*6  OR  M0*7  OR  M0*8  THEN  SEASON* ' SUMMER ' ; 

62.  ELSE  DELETE: 

63 

64  ..........  THE  COEFFICIENTS  FOR  THE  PARTICULAR  WEATHER  ELEMENTS  •••••— 

65.  ..........  ARE  ENTERED  FROM  THE  LINEAR  REGRESSION  PROGRAM.  BY 

66  ..........  SEASON  AND  TIME  . . . . . 

67 

68.  IF  SEASON* ' SUMMER '  ANO  TIME* 'AM'  THEN  DO; 

69.  FCST 1*187  -  0.238-TEMP  *  0.334-0EWPT  -  0.1B8-0UA0  -  0.059-SLP: 

70.  FCST2-130  *  0.487-TEMP  *  0.165-DEWPT  -  0.O65-OUAD  -  0.028-SLP: 

71.  FCST3* 13 1  *  0. 300-TEMP  ♦  0.243-0EWPT  -  0.18O-0UA0  -  0.016-SLP: 

72  FCST4*  37  *  0. 587-TEMP  *  0.121-0EWPT  -  0.153-0UAD  ♦  0.048-SLP; 

73  FCST5-134  *  0.174-TEMP  *  0.268-0EWPT  -  0.115-0UA0  *  0.010-SLP; 

74,  END: 

73  . 

76  IF  SEASON* 'SUMMER'  ANO  TIME* 'PM'  THEN  00; 

77  FCST 1 •  54*  0  451 -TEMP  *  0.263-DEWPT  -  0.283-0UAD  *  0.039-SLP: 

78  FCST2*-  89*  0.690*TEMP  *  0.184*DEWPT  -  0.186-0UA0  *  0.130-SLP: 

79  FCST3*  72*  0.234-TEMP  *  0.283-0EWPT  -  0. 229-QUAD  *  0.079-SLP: 


80  FCST4*  5-  0  621-TEMP  -  0  063-DEWPT  -  0.096-OUAD  *  0  091-SLP: 

81  POSTS*  8*  0  343-TEMP  -  0.212-0EWPT  -  0.298-QUAD  *  0.131-SLP: 

82  END. 

83  . 

84  If  SEASON* ' WINTER '  AND  T I  ME  * ' AM '  THEN  DO: 

85  POST  1  *  84  -  0.389-TEMP  *  0.494-OEWPT  »  O.OS2-OUAO  -  0.033-SLP; 

86  PCST2*  32  *  0  464-TEMP  *  0.4S5-DEWPT  -  0.2O5-0UAD  -  0.010-SLP; 

87  PCST3*  66  *  0.255-TEMP  *  0.481-0EWPT  *  0.233-QUAD  *  0.003-SLP; 

88  FCST4*  31  -  0.484-TEMP  -  0.251-OEWPT  -  0.172-0UAD  ♦  0.038-SLP: 

89  FCST5*  33  *  0.346-TEMP-  0.374-DEWPT  ♦  0  176-QUAD  *  0.041-SLP' 

90  END ; 

91 

92  IF  SEASON- 'WINTER '  AND  TIME-'PM'  THEN  DO; 

93.  POST  1 -  38  -  0.528-TEMP  -  0.221-DEWPT  -  0.008-QUAD  *  0.031-SLP; 

94  FCST2*  19  -  0 . 498-TEMP  *  0.190-0EWPT  ♦  0.014-QUAD  ♦  0.066-SLP: 

95  FCST3-  39  *  0.309-TEMP  -  0.314-0EWPT  -  0.164-QUAD  *  0.063-SLP; 

96  PCST4--25  -  0. 623-TEMP  ♦  0.123-OEWPT  *  0.193-QUAD  *  0.091-SLP; 

97,  FCST5-  27  -  0.348-TEMP  ♦  0.215-0EWPT  *  0.155-QUAD  -  0.090-SLP- 

98  .  END : 

99. 

’00.  . .  4  RESIDUAL  IS  CALCULATED  FORM  the  ACTUAL  TEMPERATURE  •* 

101.  . .  and  THE  FORECASTED  TEMPERATURE.  IF  the  residual  IS  LESS 

102.  ..........  THAN  2  DEO  C.  THEN  THE  FORECAST  IS  JUDGED  CORRECT. 

103  . 

104  R 1 -FCST 1 -TEMP ; 

105  R2-FCST2-TEMP; 

106.  R3-FCST3-TEMP; 

107  R4-FCST4-TEMP ; 

108.  R5-FCST5-TEMP; 

109 

1  10  ARRAY  num  R1-R5: 

111  array  NUMCOR  N1-NS; 

112.  DO  I  ■  1  TO  5: 

13.  IF  NUM-  THEN  DELETE: 

114  IF  ABS( NUM )<■  2  THEN  NUMCOR- 1; 

115.  ELSE  IF  ABS ( NUM }  >2  "HEN  NUMCOR-O: 

116  SUMi-SUM(OF  N1-N5): 

117  END : 

118 

119  PROC  SORT; 

120  BY  MQ  TIME; 

121  . 

122.  PROC  PLOT; 

123  BY  MO  TIME: 

124  PLOT  R1-0A-' 1 > 

125  R2-0A- ' 2 ' 

126  R3-0A- ■ 3 ' 

127  R4-0A--4- 

128  R5-DA-'5’/VREF«2.0.-2  OVERLAY; 

129 

130  PROC  SORT; 

131  BY  MO  TIME; 

132  PROC  MEANS  NOPRINT; 

133  BY  MO  TIME; 

134  VAR  N1-N5  SUM1 ; 

135  OUTPUT  OUT -SET  1  SUM-M 1 -M5  SUM  1  N-P1-P5  SU 1 ; 

136 

137  . 4  COUNT  IS  KEPT  OF  THE  NUMBER  OF  CORRECT  FORECASTS.  - 

138.  ..........  NUMBER  OF  INCORRECT  FORECASTS,  ANO  THE  TOTAL  NUMBER  OF 

139  ..........  FORECASTS .  THE  PERCENT  CORRECT  (%)  IS  THEN  OUTPUT  • 

140 . 

141 .  DATA  FINAL; 

142.  SET  SET  1  ; 

143.  TOTFCSTS-SUM( OF  P1-P5); 

144  PROC  SORT; 

145  BY  MO  TIME : 

146.  PROC  MEANS  NOPRINT ; 

147  BY  MO  TIME ; 

148.  VAR  TOTFCSTS; 

149.  OUTPUT  0UT-SET2  SUM-TOTFCSTS ; 

150  DATA  PERCENT: 

151  MERGE  SET  1  SET2  : 

152  BY  MO  TIME: 

153 .  PCNTCOR- ( ( SUM  1 ) /TOTFCSTS 1*100; 

154  PROC  SORT; 

155  BY  MO  TIME: 

156  PROC  PRINT; 

157  8Y  MO  TIME: 

158  VAR  SUM  1  TOTFCSTS  PCNTCOR; 
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.  THIS  BLOCK  OF  COOE  7AKES  the  REGRESSION  COEFFICIENT  ------- 

.  FROM  THE  PARTICULAR  LAG  AND  ENTERS  THEM  AS  CONSTANTS  . 

.......  (OUTPUT  is  FROM  linear  REGRESSION  PROGRAM)  THE  TEST 

.  RESTRICT  PROCEDURE  THEN  COMPARES  THE  RESTRICTED  MODEL  . 

■ .  USING  THE  CONSTANT  REGRESSION  COEFFICIENTS  TO  THE  FULL  ---• 

.  model,  an  f-statistic  is  generated  to  allow  determin-  ---• 

.......  ation  of  which  set  of  coefficients  provide  a  better  . 

. .  PREDICTOR  (FULL  VS  REDUCED  MODEL),  . . . 

PROC  REG  DATA-L 1 ; 

8V  SEASON  TIME ; 

MOOEL  TEMP.T1  DPI  Dl  Pi; 

IF  SEASON- • SUMMER '  AND  TIME-'AM'  THEN 

RESTRICT  T1« . 22. DPI- . 32.01*- .  18  ,  PI*-  .  09  : 

ELSE  IF  SEASON* ' SUMMER '  AND  TIME-'PM'  THEN 
RESTRICT  T1*  4G.0P1- .26.01*-  27.P1-  05- 
ELSE  OELETE ; 
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1  ----------  THIS  SEQUENCE  OF  CODE  USES  CLIMATOLOGY.  PERSISTENCE  ------- 

2  ..........  AND  The  average  of  clim.  AND  PERSIST,  to  CALCULATE  . 

3.  ..........  the  %  CORRECT  FORECASTS.  -------- . . . 

4 

5.  - - ...........  CLIMATOLOGICAL  values  of  TEMPERATURE 

6 

?  IF  MO* 1  AND  TIME* ' AM '  THEN  SAMC*266.5; 

8  ELSE  IF  MO  *  1  AND  TIME-'PM'  THEN  SAMC-270.2; 

9  ELSE  IF  MO-2  AND  TIME* ' AM'  THEN  SAMC-266  1 ; 

10.  ELSE  IF  MO-2  AND  TIME-'PM'  THEN  SAMC-270.9; 

11  ELSE  IF  MO-6  AND  TIME- 'AM'  THEN  SAMC-291.6; 

12.  ELSE  IF  MO-6  AND  TIME-'PM'  THEN  SAMC-300.1; 

13.  ELSE  IF  MO-7  AND  TIME- ' AM '  THEN  SAMC-293.8; 

14  ELSE  IF  MO-7  AND  TIME-'PM'  THEN  SAMC-303.0: 

15.  ELSE  IF  MO-8  AND  TIME-'AM'  THEN  SAMC-291.7; 

16  ELSE  IF  MO-8  AND  TIME-'PM'  THEN  SAMC-301.5: 

17.  ELSE  IF  MO-12  AND  TIME-'AM'  THEN  SAMC-270.1; 

18.  ELSE  IF  MO-12  AND  TIME-'PM'  THEN  SAMC-272.7; 

19 

20  ..........  THE  OIF  FUNCTION  SERVES  AS  PERSITENCE .  DIF  TAKES  ---------- 

21.  ..........  THE  FORECAST  DAY'S  TEMPERATURE  ANO  CARRIES  THE  SAME  -------- 

22.  ..........  VALUE  IN  THE  FUTURE.  RESIDUALS  ARE  COMPUTED  FROM  ----- . 

23  ..........  THIS  VALUE  AND  THE  ACTUAL  TEMPERATURE  ON  THAT  LAG .  --------- 

24  . 

25  SAMP  1 -01 F 1 ( TEMP  )  ; 

26  SAMP2-0I F2( TEMP  )  ; 

27.  SAMP3-0IF3( TEMP ) ; 

28.  SAMP4-DIF4(TEMP  )  ; 

29.  SAMP5-0I F5 ( TEMP  )  ; 

30. 

31  . . .  RESIDUALS  OF  CLIMATOLOGY  ARE  CALCULATED  ------ . 

32. 

33.  RRl-SAMC-TEMP; 

34  RR2-SAMC-TEMP; 

35.  RR3-SAMC-TEMP: 

36.  RR4-SAMC-TEMP: 

37.  RR5-SAMC-TEMP; 

38 

39  THE  AVERAGE  RESIOUALS  OF  CLIMO  ANO  PERSISTENCE  ARE  CALCULATED  — 

40. 

4  1.  R  1  ■  (  RR  1+SAMP  1  )/2  : 

42  R2- ( RR2+SAMP2 )/2 ; 

43.  R3-(RR3^SAMP3 )/2 ; 

44  R4-( RR4+SAMP4 )/2; 

45  R5-(RR5*SAMP5)/2; 

46  . 

47  ARRAY  NUM  R1-R5; 

48.  ARRAY  NUMCOR  N1-N5; 

49.  DO  _I  -  1  TO  5; 

5;.  IF  NUM- .  THEN  DELETE; 

51  IF  ABS(NUM)<-  2  THEN  NUMCOR- 1; 

52.  ELSE  IF  ABS(NUM)  >2  THEN  NUMCOR-O; 

53.  SUM 1 -SUM( OF  N1-N5); 

54 .  END ; 

55  . 

56  PROC  SORT ; 

57  BY  MO  TIME; 

58  PROC  MEANS  NOPRINT; 

59  BY  MO  TIME; 

60.  VAR  N 1 -N5  SUM1 : 

61  OUTPUT  OUT -SET  1  SUM-MI -M5  SUM  1  N-P1-P5  SU1 : 

62.  DATA  FINAL. 

63  SET  SET  1 ; 

64  TOTFCSTS-SUM(OF  P1-P5); 

65  PROC  SORT; 

66.  BY  MO  TIME; 

67  PROC  MEANS  NOPRINT; 

68  BY  MO  TIME; 

69  VAR  TOTFCSTS; 

70.  OUTPUT  0UT-SET2  SUM-TOTFCSTS ; 

71  OATA  PERCENT; 

7 2  MERGE  SET i  SET2 ;  BY  MO  TIME; 

73  PCNTCOR-( (SUM1 )/T0TFCST$)-1OO: 

74  PROC  SORT; 

75  BY  MO  TIME; 

76  PROC  PRINT; 

77  BY  MO  TIME; 

78  VAR  SUM  1  TOTFCSTS  PCNTCOR : 
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CLIMATOLOGICAL  DATA  FOR  ASTRAKHAN.  RS 

STATION:  ASTRAKHAN.  RS 
LOCATION:  46°  21'  N  47°  59'  E 
ELEVATION:  13  m 
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CLIMATOLOGICAL  TABLES  FOR  ASTRAKHAN.  RS 


B-l.  (general  climatology  by  month  and  time 

B-2:  Temperature  by  month 

B-3:  Temperature  by  month  and  pentad 

B-4:  Temperature  by  month  and  hour 

B-5:  Dewpoint  temperature  by  month 

B-b:  Relative  humidity  by  month  and  hour 

B-7:  Sea  level  pressure  iSLPl  by  month 

B-8:  SLP  by  month  and  pentad 

B-9:  SLP  by  month  and  hour 

B- 10:  Wind  speed  and  wind  direction  by  month 

B-ll:  Wind  speed  and  wind  direction  by  month  and  pentad 

B-12:  Wind  speed  and  wind  direction  by  month  and  hour 

B-13:  Visibility  by  month  and  pentad 

B-14:  Visibility  by  month  and  hour 

B-15:  Ceiling  height  and  sky  cover  by  month 

B-16:  Celling  height  and  sky  cover  by  month  and  pentad 

B-17:  Ceiling  height  and  sky  cover  by  month  and  hour 

B-18:  Weather  types  by  month 

B- 19:  Weather  types  by  month  and  hour 


ASTRAKHAN,  RS 
CLIMATOLOGICAL  DATA 
PERIOD  OF  RECORD:  1959-1986 
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Calculations  based  on  a  range  of  5446-5780  observations  per  month  with  the  exception  of  February  which  was  based  on 
5369  observations.  Standard  Deviation  of  wind  speed  data  ranged  from  3.3-7. 1  kts  per  cell. 

Standard  Deviation  of  percentages:  2%  per  cell.  Coefficient  of  variation  of  wind  speed  ranged  from  41-76%  per  cell 
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ASTRAKHAN,  RS 

PERCENTAGE  OCCURRENCE  OF  VISIBILITY  CLIMATOLOGY 
BY  MONTH  AND  HOUR 
PERIOD  OF  RECORD:  1959-1986 


Calculations  based  on  a  range  of  667-805  observations  per  cell. 

Standard  Deviation  of  data:  0.1%  per  cell. 

The  greatest  variability  occurred  during  the  winter  months.  Throughout  the  entire 
the  hours  of  greatest  variability  in  visibility. 
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APPENDIX  C 


ORENBURG, RA 
CLIMATOLOGICAL  DATA 
PERIOD  OF  RECORD:  1959-1986 
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APPENDIX  D 


CLIMATOLOGICAL  DATA  FOR  KALMIKOVO.  RA 
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LOCATION:  49°  03’  N  51°  53’  E 
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CLIMATOLOGICAL  DATA  FOR  GURYEV.  R 
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LOCATION:  47°  01'  N  51 :  52'  E 
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APPENDIX  F 


CLIMATOLOGICAL  DATA  FOR  RYAZAN.  RS 

STATION:  RYAZAN.  RS 
LOCATION  54c  37'  N  39"  44'  E 
ELEVATION:  135  m 
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CLIMATOLOGICAL  DATA  FOR  PENZA.  RS 


STATION.  PENZA.  RS 
LOCATION:  .53°  08'  N  45:  02‘  E 
ELEVATION:  235  m 
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APPENDIX  H 


CLIMATOLOGICAL  DATA  FOR 
KUYBYSHEV/KURUMOCH.  RA 


STATION:  KUYBVSHEV  KURL'MOCH.  RA 
LOCATION:  53 ;  15'  N  50°  28'  E 
ELEVATION-  44  m 


O  ROSTOV -ON-DON 


A5TRAK  rt»N 


KUYBYSHEV/KURUM0C1I,  RA 
CLIMATOLOGICAL  DATA 
PERIOD  OF  RECORD:  1959-1986 
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